Title :
Tone recognition for continuous accented Mandarin Chinese
Author :
Jiang Wu ; Zahorian, Stephen A. ; Hongbing Hu
Author_Institution :
Dept. of Electr. & Comput. Eng., Binghamton Univ., Binghamton, NY, USA
Abstract :
In this paper, the ability of human listeners to recognize tones from continuous Mandarin Chinese is evaluated and compared to the accuracy of automatic systems for tone classification and recognition. All tones used for experimentation were extracted from the RASC863 continuous Mandarin Chinese database. The human listeners are native speakers of Mandarin and the automatic methods consist of tone classification using neural networks and tone recognition using Hidden Markov Models. Features used for the automatic methods are a combination of spectral/temporal features, energy contours, and pitch contours. When very little context is used (i.e., vowel segments only) the human and machine performance is comparable. However, as the context interval is increased, the human performance is much better than the best machine performance obtained.
Keywords :
hidden Markov models; neural nets; speaker recognition; RASC863 continuous Mandarin Chinese database; automatic systems; continuous accented Mandarin Chinese; energy contours; hidden Markov models; human listeners; native speakers; neural networks; pitch contours; spectral-temporal features; tone classification; tone recognition; vowel segments; Abstracts; Acoustics; Indexes; Radio frequency; HMMs; continuous Mandarin Chinese; human listeners; neural networks; tone recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639056