DocumentCode :
626518
Title :
A detection method of nasalised vowels based on an acoustic parameter derived from phase spectrum
Author :
Shahnaz, Celia ; Najnin, Shamima ; Fattah, Shaikh Anowarul ; Wei-Ping Zhu ; Ahmad, M. Omair
Author_Institution :
Dept. of Electr. & Electron. Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh
fYear :
2013
fDate :
19-23 May 2013
Firstpage :
297
Lastpage :
300
Abstract :
In this paper, a phase spectrum based acoustic parameter is presented for the detection of nasalized vowels from the mixture of oral and nasalized vowels of normal speakers. Acoustic analysis shows that during the event of nasalization, although additional formants (resonances) at various frequency locations are introduced, the introduction of a new formant in low frequency region around 250 Hz is found to remain consistent irrespective of female or male speakers in the modified group delay derived from the phase spectrum. By exploiting and verifying this fact on the band-limited modified group delay spectrum capable of resolving two closely spaced formants, an acoustic parameter RMGD is derived. Utilizing RMGD, the problem of detecting nasalized vowels is solved based on a threshold based scheme or a Euclidean distance based classifier. Simulation Results on TIMIT database show that the proposed method even with a simple classifier is superior in performance in comparison to that of the methods using Mel-frequency cepstral coefficients as a feature and Hidden Markov Modeling or Support Vector Machine as a classifier.
Keywords :
acoustic transducers; delays; hidden Markov models; pattern classification; speaker recognition; support vector machines; Euclidean distance based classifier; Mel-frequency cepstral coefficient; RMGD; TIMIT database; automated speech recognition; band-limited modified group delay spectrum; hidden Markov modeling; nasalised vowel detection method; phase spectrum based acoustic parameter analysis; speaker; support vector machine; threshold based scheme; Accuracy; Delays; Educational institutions; Euclidean distance; Mel frequency cepstral coefficient; Speech; Euclidean distance; acoustic parameter; modified group delay function; nasalized vowel; phase spectrum;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
Conference_Location :
Beijing
ISSN :
0271-4302
Print_ISBN :
978-1-4673-5760-9
Type :
conf
DOI :
10.1109/ISCAS.2013.6571841
Filename :
6571841
Link To Document :
بازگشت