Title :
Pronunciation variation across different dialects for English: A syllable-centric approach
Author :
Brunet, R. Golda ; Murthy, Hema A.
Author_Institution :
Dept. of Comput. Sci. & Eng., IIT Madras, Chennai, India
Abstract :
Automatic Speech Recognition (ASR) systems account for wide variability in the acoustic signal through large amounts of training data. From a linguistic point of view, the acoustic variability is a consequence of pronunciation variation. It is apparent that neither (i) any two speakers utter the same words exactly the same way nor (ii) an individual can repeat the same words with acoustic identity. Hence ASR systems usually rely on multiple-pronunciation lexicons to match an acoustic sequence with a lexical unit. In this study, we have adopted a data-driven approach to generate pronunciation variants at syllable level. Group-Delay (GD) segmentation algorithm is used to acquire acoustic cue about syllable boundaries, which are validated by a vowel-onset point (VOP) detection algorithm. Manual transcriptions of GD syllable segments are done to produce new pronunciation variants. Results on the TIMIT database show that some pronunciations are exclusive for a particular dialect.
Keywords :
acoustic signal processing; natural language processing; speech recognition; English; acoustic cue; acoustic sequence; acoustic signal; automatic speech recognition; dialects; group-delay segmentation algorithm; lexical unit; multiple pronunciation lexicon; pronunciation variation; syllable boundary; syllable level; syllable-centric approach; vowel-onset point detection algorithm; Acoustics; Detection algorithms; Manuals; Production; Speech; Speech processing; Speech recognition;
Conference_Titel :
Communications (NCC), 2012 National Conference on
Conference_Location :
Kharagpur
Print_ISBN :
978-1-4673-0815-1
DOI :
10.1109/NCC.2012.6176740