Title :
HMM-based phonemic distance in different speaking styles and its influence on substitutions in Mandarin speech recognition
Author :
Yang, Zhanlei ; Liu, Wenju ; Lv, Zhenyu
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
Abstract :
Statistical confusability between different acoustic models is important to character substitution error rate in large vocabulary continuous speech recognition. In this paper, we take factors of gender and speaking styles into consideration in Mandarin speech recognition. We modeled phonemes in different speaking styles, including read speech of female, male, and spontaneous dialogue. Then minimum gaussian distances between Chinese Initial/Final model pairs are given and average phoneme distances are calculated which denote the pronunciation varieties. The effect of different style to average phonemic distance is studied and relative articulation is given for three databases. Qualitative relationship between phone size and error rate in recognition is analytical researched, showing that for a particular phoneme, pronunciation variety is one of reasons for misidentification in recognizing process, which provides us a novel mind to reduce substitution errors.
Keywords :
Gaussian processes; hidden Markov models; speech processing; speech recognition; Chinese initial-final model pair; HMM based phonemic distance; Mandarin speech recognition; acoustic model; character substitution error rate; gender factor; large vocabulary continuous speech recognition; minimum gaussian distances; phone size; speaking style; substitution errors reduction; Acoustic applications; Automation; Databases; Decoding; Error analysis; Hidden Markov models; Loudspeakers; Performance analysis; Speech recognition; Vocabulary; articulation; error rate; phonemic distance; pronunciation variety;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2009. NLP-KE 2009. International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-4538-7
Electronic_ISBN :
978-1-4244-4540-0
DOI :
10.1109/NLPKE.2009.5313752