Title :
Chime-home: A dataset for sound source recognition in a domestic environment
Author :
Peter Foster;Siddharth Sigtia;Sacha Krstulovic;Jon Barker;Mark D. Plumbley
Author_Institution :
School of Electronic Engineering and Computer Science, Queen Mary University of London, UK
Abstract :
For the task of sound source recognition, we introduce a novel data set based on 6.8 hours of domestic environment audio recordings. We describe our approach of obtaining annotations for the recordings. Further, we quantify agreement between obtained annotations. Finally, we report baseline results for sound source recognition using the obtained dataset. Our annotation approach associates each 4-second excerpt from the audio recordings with multiple labels, based on a set of 7 labels associated with sound sources in the acoustic environment. With the aid of 3 human annotators, we obtain 3 sets of multi-label annotations, for 4378 4-second audio excerpts. We evaluate agreement between annotators by computing Jaccard indices between sets of label assignments. Observing varying levels of agreement across labels, with a view to obtaining a representation of ‘ground truth’ in annotations, we refine our dataset to obtain a set of multi-label annotations for 1946 audio excerpts. For the set of 1946 annotated audio excerpts, we predict binary label assignments using Gaussian mixture models estimated on MFCCs. Evaluated using the area under receiver operating characteristic curves, across considered labels we observe performance scores in the range 0.76 to 0.98. Dataset URL: http://archive.org/details/chime-home
Keywords :
"Acoustics","Speech","Speech processing","Audio recording","Conferences","Speech recognition"
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015 IEEE Workshop on
DOI :
10.1109/WASPAA.2015.7336899