DocumentCode
1724671
Title
Production features for detection of shouted speech
Author
Mittal, Vinay Kumar ; Yegnanarayana, B.
Author_Institution
Speech & Vision Lab., Int. Inst. of Inf. Technol., Hyderabad, India
fYear
2013
Firstpage
106
Lastpage
111
Abstract
Shouted speech or screaming signals have been studied mostly through spectral representation such as melcepstral coefficients. Intuitive evidence that the characteristics of the excitation source may vary in the case of shouted speech has drawn little attention yet. In this paper we examine how the characteristics of both components of speech production mechanism, especially the glottal excitation source, are modified during the production of shout signals. Shouted and normal speech signals are examined along with the corresponding Electro-glotto-graph (EGG) signals. Distinguishing features like the dominant frequency and the strength of excitation are explored, along with the instantaneous fundamental frequency. These features are computed using linear prediction analysis and zero frequency filtering of the speech signal. Efficacy of these features in discriminating between shouted and normal speech is tested in five different vowel contexts.
Keywords
signal detection; speech processing; EGG signals; electroglotto-graph signals; excitation source; linear prediction analysis; melcepstral coefficients; screaming signals; shout signal production; shouted speech detection; speech production mechanism; vowel contexts; zero frequency filtering; Context; Feature extraction; Production; Shape; Spectrogram; Speech; Vibrations; EGG; LP analysis; ZFF; dominant frequency; shout detection; shouted speech; shouts; zero-frequency filtering;
fLanguage
English
Publisher
ieee
Conference_Titel
Consumer Communications and Networking Conference (CCNC), 2013 IEEE
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-4673-3131-9
Type
conf
DOI
10.1109/CCNC.2013.6488433
Filename
6488433
Link To Document