A domain and noise adversarial bird tune classification pipeline using deep neural network
Abstract
Birds are an important category of animals that ecologists keep track of utilizing
autonomous recording units as a key indication of environmental health. Because
of the consequences of climate change and the rising number of endangered species,
many experts suggested developing an animal species recognition system to help
them in specialized research. Researchers can improve their ability to assess the
state of biodiversity and its patterns in crucial ecosystems by precise sound detection and categorization, which is supported by machine learning, allowing them to better support global conservation efforts. However, producing analysis outputs
with high precision and recall remains a difficulty. Due to a lack of appropriate
methods for efficient and accurate extraction of interest signals, the vast bulk of
data remains unexplored (e.g., bird calls). Moreover, due to strong source-domain specific features and artificial/natural noises, these acquired raw data create different distributions in datasets. So, to ensure a generalized feature learning, domain adaptation [1] techniques will be implemented in this work to make the networks familiar towards both acquisition sensor noises and background noises without having to do intensive dataset specific augmentations. We used 3 popular and powerful
DNN models, including CNN, VGG19 and ResNet50. Out of them, for the bird
species classification task VGG19 achieved the best accuracy of 96.02% in testing and 94.01% in training. To the best of our knowledge, this will guide towards convenient and deployable in real life models which will allow future works into the pipeline to ensure better coverage.