DocumentCode :
2110827
Title :
Naxi-accented Mandarin speech recognition based on pronunciation dictionary adaptation
Author :
Chen Jiang ; Yang Jian ; Xu Yonghua
Author_Institution :
Sch. of Inf. Sci. & Eng., Yunnan Univ., Kunming, China
fYear :
2010
fDate :
29-31 July 2010
Firstpage :
2685
Lastpage :
2689
Abstract :
This paper primarily concentrates on Mandarin LVCSR (Large Vocabulary Continuous Speech Recognition) for nonnative speakers from Yunnan, China. With an available standard Mandarin speech recognizer, we attempt to establish an accent-specific speech recognizer for the speakers whose native language is Naxi language, based on the Initial-Final structure of the Chinese language, in combination with the variation regularity of pronunciation in Naxi-accented speech. In order to obtain the variation regularity of the initials, finals and syllables in Naxi-accented speech, we analyze the pronunciation variability of the non-native speech using the data-driven approach supervised by expert knowledge. A novel method to automatically construct multi-pronunciation dictionary of the given accent which can be easily extended to the other linguistic minorities´ accents was proposed. Experimental results show that the Naxi-accented speech recognition rates with the multi-pronunciation dictionary constructed by the proposed method are higher than the rates with single pronunciation dictionary, after imported bi-gram language model.
Keywords :
dictionaries; natural language processing; speaker recognition; Chinese language; Mandarin LVCSR; Naxi language; Naxi-accented mandarin speech recognition; Yunnan; accent-specific speech recognizer; bi-gram language model; data-driven approach; initial-final structure; large vocabulary continuous speech recognition; multipronunciation dictionary adaptation; nonnative speech; pronunciation variability analysis; Acoustics; Adaptation model; Dictionaries; Hidden Markov models; Markov processes; Speech; Speech recognition; Naxi-Accented Mandarin; Pronunciation Dictionary Adaptation; Speaker Adaptation; Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2010 29th Chinese
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6263-6
Type :
conf
Filename :
5573555
Link To Document :
بازگشت