DocumentCode
3178442
Title
Impact of pronunciation variation in speech recognition
Author
Brunet, R. Golda ; Murthy, Hema A.
Author_Institution
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
fYear
2012
fDate
22-25 July 2012
Firstpage
1
Lastpage
5
Abstract
Mapping the acoustic sequence to lexical units is an issue in speech recognition. To address this, multiple pronunciations are included in the pronunciation dictionary. However, the number of lexical variants required for improved recognition is not clear as pronunciation varies significantly across dialects. This can lead to poor recognition sometimes. In this paper, a systematic study is carried out to observe the effect of pronunciation variation on recognition accuracy. In particular, a data-driven approach is employed to observe pronunciation variation at syllable level. The acoustic cue about the syllable boundaries are obtained from Group Delay (GD) segmentation. The preliminary experiments carried out for TIMIT corpus reveal that the use of prominent pronunciation variants for each dialect leads to an improved recognition performance.
Keywords
dictionaries; speech recognition; acoustic sequence; data-driven approach; dictionary; group delay segmentation; lexical units; pronunciation variation; speech recognition; syllable boundaries; Acoustics; Delay; Dictionaries; NIST; Software; Speech; Speech recognition; Pronunciation Dictionary; Pronunciation Variation; Speech Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Communications (SPCOM), 2012 International Conference on
Conference_Location
Bangalore
Print_ISBN
978-1-4673-2013-9
Type
conf
DOI
10.1109/SPCOM.2012.6290037
Filename
6290037
Link To Document