Title :
Sound recognition: a connectionist approach
Author :
Harb, Hadi ; Chen, Liming
Author_Institution :
Dept. Mathematiques Informatiques, Ecole Centrale de Lyon, Ecully, France
Abstract :
This paper presents a general audio classification approach inspired by our modest knowledge about the human perception of sound. Simple psychoacoustic experiments show that the relation between short term spectral features has a great impact on the human audio classification performance. For instance, short term spectral features extracted from speech sound can be perceived as non-speech sounds if organized in a special way in time. We have developed the idea of incorporating several consecutive spectral features when modelling the audio signal in relatively long term time windows. The modelling scheme that we propose, piecewise Gaussian modelling (PGM), was combined with a neural network to develop a general audio classifier. The classifier was evaluated on the problems of speech/music classification. male/female classification and special events detection in sports videos. The good classification accuracy obtained by the classifier suggests us to continue the research in order to improve the model and to closely combine it to some well-known psychoacoustic experimental results.
Keywords :
Gaussian processes; audio signal processing; feature extraction; modelling; neural nets; signal classification; speaker recognition; audio classifier; audio signal; feature extraction; general audio classification approach; human audio classification; neural network; piecewise Gaussian modelling; psychoacoustic experiments; sound recognition; Context modeling; Event detection; Feature extraction; Frequency; Humans; Loudspeakers; Neural networks; Psychoacoustic models; Psychology; Speech;
Conference_Titel :
Signal Processing and Its Applications, 2003. Proceedings. Seventh International Symposium on
Print_ISBN :
0-7803-7946-2
DOI :
10.1109/ISSPA.2003.1224953