• DocumentCode
    626518
  • Title

    A detection method of nasalised vowels based on an acoustic parameter derived from phase spectrum

  • Author

    Shahnaz, Celia ; Najnin, Shamima ; Fattah, Shaikh Anowarul ; Wei-Ping Zhu ; Ahmad, M. Omair

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh
  • fYear
    2013
  • fDate
    19-23 May 2013
  • Firstpage
    297
  • Lastpage
    300
  • Abstract
    In this paper, a phase spectrum based acoustic parameter is presented for the detection of nasalized vowels from the mixture of oral and nasalized vowels of normal speakers. Acoustic analysis shows that during the event of nasalization, although additional formants (resonances) at various frequency locations are introduced, the introduction of a new formant in low frequency region around 250 Hz is found to remain consistent irrespective of female or male speakers in the modified group delay derived from the phase spectrum. By exploiting and verifying this fact on the band-limited modified group delay spectrum capable of resolving two closely spaced formants, an acoustic parameter RMGD is derived. Utilizing RMGD, the problem of detecting nasalized vowels is solved based on a threshold based scheme or a Euclidean distance based classifier. Simulation Results on TIMIT database show that the proposed method even with a simple classifier is superior in performance in comparison to that of the methods using Mel-frequency cepstral coefficients as a feature and Hidden Markov Modeling or Support Vector Machine as a classifier.
  • Keywords
    acoustic transducers; delays; hidden Markov models; pattern classification; speaker recognition; support vector machines; Euclidean distance based classifier; Mel-frequency cepstral coefficient; RMGD; TIMIT database; automated speech recognition; band-limited modified group delay spectrum; hidden Markov modeling; nasalised vowel detection method; phase spectrum based acoustic parameter analysis; speaker; support vector machine; threshold based scheme; Accuracy; Delays; Educational institutions; Euclidean distance; Mel frequency cepstral coefficient; Speech; Euclidean distance; acoustic parameter; modified group delay function; nasalized vowel; phase spectrum;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
  • Conference_Location
    Beijing
  • ISSN
    0271-4302
  • Print_ISBN
    978-1-4673-5760-9
  • Type

    conf

  • DOI
    10.1109/ISCAS.2013.6571841
  • Filename
    6571841