مرکز منطقه ای اطلاع رساني علوم و فناوري - Tone recognition of continuous Mandarin speech based on neural networks

DocumentCode :

1246538

Title :

Tone recognition of continuous Mandarin speech based on neural networks

Author :

Chen, Sin-Homg ; Wang, Yih-Ru

Author_Institution :

Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan

Volume :

Issue :

fYear :

1995

fDate :

3/1/1995 12:00:00 AM

Firstpage :

146

Lastpage :

150

Abstract :

Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate For the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved

Keywords :

hidden Markov models; neural nets; speech recognition; HCNN; HSMLP; basic MLP tone recognizer; best tone sequence; coarticulation effect; continuous Mandarin speech; hidden Markov chain; input utterance; intonation pattern; neighboring syllables; neural networks; processing syllable; recognition features; sandhi rules; tone pronunciation; tone recognition schemes; Data mining; Feature extraction; Frequency; Hidden Markov models; Multilayer perceptrons; Natural languages; Neural networks; Pattern recognition; Shape; Speech recognition;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.366544

Filename :

366544

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1246538