Title :
Production features for detection of shouted speech
Author :
Mittal, Vinay Kumar ; Yegnanarayana, B.
Author_Institution :
Speech & Vision Lab., Int. Inst. of Inf. Technol., Hyderabad, India
Abstract :
Shouted speech or screaming signals have been studied mostly through spectral representation such as melcepstral coefficients. Intuitive evidence that the characteristics of the excitation source may vary in the case of shouted speech has drawn little attention yet. In this paper we examine how the characteristics of both components of speech production mechanism, especially the glottal excitation source, are modified during the production of shout signals. Shouted and normal speech signals are examined along with the corresponding Electro-glotto-graph (EGG) signals. Distinguishing features like the dominant frequency and the strength of excitation are explored, along with the instantaneous fundamental frequency. These features are computed using linear prediction analysis and zero frequency filtering of the speech signal. Efficacy of these features in discriminating between shouted and normal speech is tested in five different vowel contexts.
Keywords :
signal detection; speech processing; EGG signals; electroglotto-graph signals; excitation source; linear prediction analysis; melcepstral coefficients; screaming signals; shout signal production; shouted speech detection; speech production mechanism; vowel contexts; zero frequency filtering; Context; Feature extraction; Production; Shape; Spectrogram; Speech; Vibrations; EGG; LP analysis; ZFF; dominant frequency; shout detection; shouted speech; shouts; zero-frequency filtering;
Conference_Titel :
Consumer Communications and Networking Conference (CCNC), 2013 IEEE
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-3131-9
DOI :
10.1109/CCNC.2013.6488433