مرکز منطقه ای اطلاع رساني علوم و فناوري - Semi-supervised learning helps in sound event classification

DocumentCode :

3144135

Title :

Semi-supervised learning helps in sound event classification

Author :

Zhang, Zixing ; Schuller, Björn

Author_Institution :

Inst. for Human-Machine Commun., Tech. Univ. Munchen, München, Germany

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

333

Lastpage :

336

Abstract :

We investigate the suitability of semi-supervised learning in sound event classification on a large database of 17 k sound clips. Seven categories are chosen based on the findsounds.com schema: animals, people, nature, vehicles, noisemakers, office, and musical instruments. Our results show that adding unlabelled sound event data to the training set based on sufficient classifier confidence level after its automatic labelling level can significantly enhance classification performance. Furthermore, combined with optimal re-sampling of originally labelled instances and iteratively learning in semi-supervised manner, the expected gain can reach approximately half the one achieved by using the originally manually labelled data. Overall, maximum performance of 71.7% can be reported for the automatic classification of sound in a large-scale archive.

Keywords :

speech enhancement; speech recognition; large-scale archive; musical instruments; noisemakers; office; optimal resampling; recognition performance enhancement; semisupervised learning; sound automatic classification; sound event classification; training set; vehicles; Acoustics; Animals; Databases; Feature extraction; Semisupervised learning; Training; Vehicles; Semi-supervised Learning; Sound Event Classification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6287884

Filename :

6287884

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3144135