banter is a package for creating hierarchical acoustic event classifiers out of multiple call type detectors.
To install the latest version from GitHub:
# make sure you have devtools installed
if(!require('devtools')) install.packages('devtools')
# install package from GitHub
devtools::install_github('ericarcher/banter')
The BANTER (Bio-Acoustic eveNT classifiER) model is initialized with a data frame of events. There is one row per event and it must have a column called event.id
which is a unique id for each event, and a column called species
which assigns each event to a given species. Every other column in the data.frame will be used as a predictor variable for the events.
In the package, an example data.frame is in the train.data
example data list as the $events
element.
data(train.data)
bant.mdl <- initBanterModel(train.data$events)
Next, detector data is added to the initialized BANTER model object. Each detector is a data.frame with a column called event.id
that associates the detected call with an event that the model was initialized with, and a call.id
column that provides a unique identifier for each call. Every other column will be used as a predictor variable for the calls.
In the package, example data.frames for three detectors are provided in the $detectors
element of the train.data
example data list. Here is an example of adding the burst pulse (bp
) detector.
bant.mdl <- addBanterDetector(
bant.mdl,
data = train.data$detectors$bp,
name = "bp",
ntree = 10,
sampsize = 1
)
The addBanterDetector
function can be called repeatedly to add additional detectors. Alternatively, if the detectors are all in a named list, they can be added at once:
bant.mdl <- addBanterDetector(
bant.mdl,
data = train.data$detectors,
ntree = 10,
sampsize = 1
)
Once all of the detectors have been added, then the full BANTER model is run:
bant.mdl <- runBanterModel(bant.mdl, ntree = 5000, sampsize = 3)
The model can be easily summarized:
summary(bant.mdl)
The actual randomForest
model can be extracted for the event or detector models:
# extract event Random Forest model
event.rf <- getBanterModel(bant.mdl, "event")
# extract burst pulse (bp) Random Forest model
bp.rf <- getBanterModel(bant.mdl, "bp")
These can then be visualized using other tools, such as those in the rfPermute
package:
library(rfPermute)
plotVotes(event.rf)
To predict novel data, it must be in a list with the event data in the $events
element, and the detector data in a named list called $detectors
:
data(test.data)
predict(bant.mdl, test.data)
- submit suggestions and bug-reports: https://github.com/ericarcher/banter/issues
- send a pull request: https://github.com/ericarcher/banter/
- e-mail: [email protected]
Rankin, S., Archer, F., Keating, J. L., Oswald, J. N., Oswald, M., Curtis, A. and Barlow, J. (2017) Acoustic classification of dolphins in the California Current using whistles, echolocation clicks, and burst pulses. Mar Mam Sci, 33: 520-540. doi:10.1111/mms.12381
- changed behavior of
addBanterDetector()
to provide a warning instead of an error if a detector didn't have enough detections of a species and add all detectors that were valid. - minor bug fixes and edits for CRAN
- add
subsampleDetections()
to draw a random number of detections per event - fixed
predict()
so that species names innew.data
are modified usingmake.names()
like they are in constructing the initial banter model. - added run timestamps to all models and changed
summary()
to display a matrix of all. - fixed trace and inbag plots in
summary()
andplotDetectorTrace()
.
- Detector names and detector predictor column names submitted to
addBanterDetector()
are now first checked to make sure they are syntactically valid and unique by comparing them with the results frommake.names()
. - Detector names and detector predictor column names in
new.data
submitted topredict()
are first checked to make sure at least one detector from model is present. - Detectors missing in
new.data
forpredict()
will automatically have all species detector probabilities and detector proportions set to 0. - Added validation option to
predict()
ifspecies
column exists innew.data
- Default value for num.cores has been set to 1.
- Added
banterGuide()
- Initial CRAN release