DocumentCode :
1590006
Title :
Real Context Model for Tone Recognition in Mandarin Conversational Telephone Speech
Author :
Qingwei Zhao ; Jian Shao ; Pengyuan Zhang ; Yonghong Yan ; Ji Feng
Author_Institution :
Chinese Acad. of Sci., Beijing
Volume :
2
fYear :
2007
Firstpage :
696
Lastpage :
699
Abstract :
This paper presents an approach to tone recognition in mandarin conversational telephone speech (CTS) based on a real context model. The real context model is proposed as a new concept designed with special consideration on the fact that mandarin CTS is characterized by complicated tone behaviors due to physiological articulation. As pitch is a supra-segmental feature, current tone´s pitch value is influenced by its context especially in CTS for its fast speaking rate. A real context model covers not only the current tone but also the relative pitch level of pre-tone. Then we cluster the real context annotated training data into a few subsets to generate a more refined tone model. Gaussian Mixture Model (GMM) is used for the tone modeling. In addition, a kind of similarity measurements to compute the distance between two tones is employed, which should reveal the similarity of their pitch contour shapes and also include their different pitch height. All experiments are based on the mandarin CTS database, Train04. Our methods can improve tone recognition accuracy 4.7%.
Keywords :
Gaussian processes; natural languages; speech recognition; GMM; Gaussian mixture model; Mandarin conversational telephone speech; complicated tone behaviors; physiological articulation; pitch contour shapes; real context annotated training data; real context model; relative pitch level; tone recognition; Automatic speech recognition; Context modeling; Hidden Markov models; Pattern recognition; Physics; Shape measurement; Speech recognition; Support vector machine classification; Support vector machines; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation, 2007. ICNC 2007. Third International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2875-5
Type :
conf
DOI :
10.1109/ICNC.2007.595
Filename :
4344440
Link To Document :
بازگشت