Title :
Impact of pronunciation variation in speech recognition
Author :
Brunet, R. Golda ; Murthy, Hema A.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
Abstract :
Mapping the acoustic sequence to lexical units is an issue in speech recognition. To address this, multiple pronunciations are included in the pronunciation dictionary. However, the number of lexical variants required for improved recognition is not clear as pronunciation varies significantly across dialects. This can lead to poor recognition sometimes. In this paper, a systematic study is carried out to observe the effect of pronunciation variation on recognition accuracy. In particular, a data-driven approach is employed to observe pronunciation variation at syllable level. The acoustic cue about the syllable boundaries are obtained from Group Delay (GD) segmentation. The preliminary experiments carried out for TIMIT corpus reveal that the use of prominent pronunciation variants for each dialect leads to an improved recognition performance.
Keywords :
dictionaries; speech recognition; acoustic sequence; data-driven approach; dictionary; group delay segmentation; lexical units; pronunciation variation; speech recognition; syllable boundaries; Acoustics; Delay; Dictionaries; NIST; Software; Speech; Speech recognition; Pronunciation Dictionary; Pronunciation Variation; Speech Recognition;
Conference_Titel :
Signal Processing and Communications (SPCOM), 2012 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4673-2013-9
DOI :
10.1109/SPCOM.2012.6290037