• DocumentCode
    3178442
  • Title

    Impact of pronunciation variation in speech recognition

  • Author

    Brunet, R. Golda ; Murthy, Hema A.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
  • fYear
    2012
  • fDate
    22-25 July 2012
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Mapping the acoustic sequence to lexical units is an issue in speech recognition. To address this, multiple pronunciations are included in the pronunciation dictionary. However, the number of lexical variants required for improved recognition is not clear as pronunciation varies significantly across dialects. This can lead to poor recognition sometimes. In this paper, a systematic study is carried out to observe the effect of pronunciation variation on recognition accuracy. In particular, a data-driven approach is employed to observe pronunciation variation at syllable level. The acoustic cue about the syllable boundaries are obtained from Group Delay (GD) segmentation. The preliminary experiments carried out for TIMIT corpus reveal that the use of prominent pronunciation variants for each dialect leads to an improved recognition performance.
  • Keywords
    dictionaries; speech recognition; acoustic sequence; data-driven approach; dictionary; group delay segmentation; lexical units; pronunciation variation; speech recognition; syllable boundaries; Acoustics; Delay; Dictionaries; NIST; Software; Speech; Speech recognition; Pronunciation Dictionary; Pronunciation Variation; Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications (SPCOM), 2012 International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    978-1-4673-2013-9
  • Type

    conf

  • DOI
    10.1109/SPCOM.2012.6290037
  • Filename
    6290037