Title :
Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition
Author :
Hirose, Keikichi ; Iwano, Koji
Author_Institution :
Sch. of Frontier Sci., Tokyo Univ., Japan
Abstract :
We have been developing a reliable method of prosodic word boundary detection for Japanese continuous speech based on the statistical modeling of mora transitions of fundamental frequency contours of prosodic words. Modifications in the codebook sizes and in the HMM topologies improved the boundary detection performance. When using mora boundary information obtainable from the phoneme recognition process, the detection rates were reached around 73% with 12.5% insertion errors for speaker-open experiments. This method was then integrated to a continuous speech recognition system with unlimited vocabulary. The integrated system conducts the recognition process in two stages: the first stage is to detect mora boundaries without prosodic information and the second stage is to increase the mora recognition rate using prosodic word boundary information. Slight improvements in mora recognition rates were observed both in speaker-closed and -open experiments
Keywords :
hidden Markov models; natural languages; speech recognition; HMM topologies; Japanese continuous speech; boundary detection performance; codebook sizes; continuous speech recognition; detection rates; fundamental frequency contours; mora transitions; phoneme recognition process; prosodic word boundaries; speaker-closed experiments; speaker-open experiments; statistical modeling; Frequency; Hidden Markov models; Humans; Informatics; Length measurement; Reliability engineering; Speech processing; Speech recognition; Topology; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.862094