Title :
Tone recognition of continuous Mandarin speech based on neural networks
Author :
Chen, Sin-Homg ; Wang, Yih-Ru
Author_Institution :
Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fDate :
3/1/1995 12:00:00 AM
Abstract :
Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate For the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved
Keywords :
hidden Markov models; neural nets; speech recognition; HCNN; HSMLP; basic MLP tone recognizer; best tone sequence; coarticulation effect; continuous Mandarin speech; hidden Markov chain; input utterance; intonation pattern; neighboring syllables; neural networks; processing syllable; recognition features; sandhi rules; tone pronunciation; tone recognition schemes; Data mining; Feature extraction; Frequency; Hidden Markov models; Multilayer perceptrons; Natural languages; Neural networks; Pattern recognition; Shape; Speech recognition;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on