DocumentCode
706130
Title
Speech — Nonspeech discrimination based on speech-relevant spectrogram modulations
Author
Wohlmayr, Michael ; Markaki, Maria ; Stylianou, Yannis
Author_Institution
Comput. Sci. Dept., Univ. of Crete, Heraklion, Greece
fYear
2007
fDate
3-7 Sept. 2007
Firstpage
1551
Lastpage
1555
Abstract
In this work, we adopt an information theoretic approach - the Information Bottleneck method - to extract the relevant modulation frequencies across both dimensions of a spectrogram, for speech / non-speech discrimination (music, animal vocalizations, environmental noises). A compact representation is built for each sound ensemble, consisting of the maximally informative features. We demonstrate the effectiveness of a simple thresholding classifier which is based on the similarity of a sound to each characteristic modulation spectrum. When we assess the performance of the classification system at various SNR conditions using F-measure, results are equally good to a recently proposed method based on the same features but having significantly greater complexity.
Keywords
feature extraction; signal classification; speech processing; F-measure; characteristic modulation spectrum; compact representation; information bottleneck method; information theoretic approach; maximally informative features; modulation frequencies; sound ensemble; spectrogram; speech-non-speech discrimination; thresholding classifier; Complexity theory; Feature extraction; Modulation; Signal to noise ratio; Speech; Speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2007 15th European
Conference_Location
Poznan
Print_ISBN
978-839-2134-04-6
Type
conf
Filename
7099066
Link To Document