DocumentCode
626518
Title
A detection method of nasalised vowels based on an acoustic parameter derived from phase spectrum
Author
Shahnaz, Celia ; Najnin, Shamima ; Fattah, Shaikh Anowarul ; Wei-Ping Zhu ; Ahmad, M. Omair
Author_Institution
Dept. of Electr. & Electron. Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh
fYear
2013
fDate
19-23 May 2013
Firstpage
297
Lastpage
300
Abstract
In this paper, a phase spectrum based acoustic parameter is presented for the detection of nasalized vowels from the mixture of oral and nasalized vowels of normal speakers. Acoustic analysis shows that during the event of nasalization, although additional formants (resonances) at various frequency locations are introduced, the introduction of a new formant in low frequency region around 250 Hz is found to remain consistent irrespective of female or male speakers in the modified group delay derived from the phase spectrum. By exploiting and verifying this fact on the band-limited modified group delay spectrum capable of resolving two closely spaced formants, an acoustic parameter RMGD is derived. Utilizing RMGD, the problem of detecting nasalized vowels is solved based on a threshold based scheme or a Euclidean distance based classifier. Simulation Results on TIMIT database show that the proposed method even with a simple classifier is superior in performance in comparison to that of the methods using Mel-frequency cepstral coefficients as a feature and Hidden Markov Modeling or Support Vector Machine as a classifier.
Keywords
acoustic transducers; delays; hidden Markov models; pattern classification; speaker recognition; support vector machines; Euclidean distance based classifier; Mel-frequency cepstral coefficient; RMGD; TIMIT database; automated speech recognition; band-limited modified group delay spectrum; hidden Markov modeling; nasalised vowel detection method; phase spectrum based acoustic parameter analysis; speaker; support vector machine; threshold based scheme; Accuracy; Delays; Educational institutions; Euclidean distance; Mel frequency cepstral coefficient; Speech; Euclidean distance; acoustic parameter; modified group delay function; nasalized vowel; phase spectrum;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
Conference_Location
Beijing
ISSN
0271-4302
Print_ISBN
978-1-4673-5760-9
Type
conf
DOI
10.1109/ISCAS.2013.6571841
Filename
6571841
Link To Document