首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Rookognise: Acoustic detection and identification of individual rooks in field recordings using multi-task neural networks
Institution:1. Department of Zoology, University of Calcutta, Kolkata, India;2. Department of Zoology, Shibpur Dinobundhoo Institution (College), Shibpur, Howrah, India;1. Biodiversity Centre, Finnish Environment Institute, Latokartanonkaari 11, FI-00790 Helsinki, Finland;2. Finnish Meteorological Institute, Weather and climate change impact research, P.O. Box 503, FI-00101 Helsinki, Finland;3. Department of Geosciences and Geography, University of Helsinki, P.O. Box 64, Gustaf Hällströminkatu 2a, 00014 Helsinki, Finland;1. National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, Hefei 230601, China;2. School of Information Science and Technology, Fudan University, Shanghai 200433, China;1. Department of Earth Sciences, Annamalai University, Tamil Nadu 608002, India;2. Birbal Sahni Institute of Palaeosciences, 53 University Road, Lucknow 226 007, India;3. Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
Abstract:Individual-level monitoring is essential in many behavioural and bioacoustics studies. Collecting and annotating those data is costly in terms of human effort, but necessary prior to conducting analysis. In particular, many studies on bird vocalisations also involve manipulating the animals or human presence during observations, which may bias vocal production. Autonomous recording units can be used to collect large amounts of data without human supervision, largely removing those sources of bias. Deep learning can further facilitate the annotation of large amounts of data, for instance to detect vocalisations, identify the species, or recognise the vocalisation types in recordings. Acoustic individual identification, however, has so far largely remained limited to a single vocalisation type for a given species. This has limited the use of those techniques for automated data collection on raw recordings, where many individuals can produce vocalisations of varying complexity, potentially overlapping one another, with the additional presence of unknown and varying background noise. This paper aims at bridging this gap by developing a system to identify individual animals in those difficult conditions. Our system leverages a combination of multi-scale information integration, multi-channel audio and multi-task learning. The multi-task learning paradigm is based the overall task into four sub-tasks, three of which are auxiliary tasks: the detection and segmentation of vocalisations against other noises, the classification of individuals vocalising at any point during a sample, and the sexing of detected vocalisations. The fourth task is the overall identification of individuals. To test our approach, we recorded a captive group of rooks, a Eurasian social corvid with a diverse vocal repertoire. We used a multi-microphone array and collected a large scale dataset of time-stamped and identified vocalisations recorded, and found the system to work reliably for the defined tasks. To our knowledge, the system is the first to acoustically identify individuals regardless of the vocalisation produced. Our system can readily assist data collection and individual monitoring of groups of animals in both outdoor and indoor settings, even across long periods of time, and regardless of a species’ vocal complexity. All data and code used in this article is available online.
Keywords:Deep learning  Bird recognition  Outdoor setting  Vocalisation-independent  Corvid  Multi-channel audio
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号