• DocumentCode
    3109693
  • Title

    Analysis of spurious vowel-like regions (VLRs) detected by excitation source information

  • Author

    Dev Sarma, Biswajit ; Prasanna, S.R.M.

  • Author_Institution
    Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This work treats vowels and semivowels as vowellike regions. An analysis of the spurious vowel-like regions (VLRs) detected by a signal processing based method using excitation source information is demonstrated. Limitation of excitation information in detecting some of the nasals and voiced consonants as non-VLRs is discussed. An attempt to reduce spurious VLRs compared to the existing signal processing based method for VLRs detection [1] is made. A multi-class statistical phone classifier that classifies speech into broad vowel, consonant and silence categories is trained. The outputs of the classifier are suitably combined to get evidence for vowel-like regions, different broad categories of consonants and silence regions. The output from the existing signal processing method is compared with different evidences from the statistical method. The spurious ones are eliminated by using the evidences from the statistical method. The experimental studies conducted on TIMIT and inhouse databases demonstrate significant reduction in the spurious VLRs with a little loss in the VLRs detection performance. A net gain of 4.21% and 7.71% in frame error rate is achieved for TIMIT and in-house databases, respectively.
  • Keywords
    database management systems; signal classification; speech processing; statistical analysis; TIMIT; VLR detection performance; broad vowel; consonant category; excitation source information; frame error rate; in-house databases; inhouse databases; multiclass statistical phone classifier; nasals; semivowels; signal processing based method; silence category; speech classification; spurious VLR; spurious vowel-like regions; statistical method; voiced consonants; Databases; Hidden Markov models; Speech; Speech processing; Statistical analysis; Testing; signal processing method; spurious VLRs; statistical method; vowel-like regions (VLRs);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2013 Annual IEEE
  • Conference_Location
    Mumbai
  • Print_ISBN
    978-1-4799-2274-1
  • Type

    conf

  • DOI
    10.1109/INDCON.2013.6725965
  • Filename
    6725965