Title :
Prosodic word boundary detection using statistical modeling of moraic fundamental frequency contours and its use for continuous speech recognition
Author :
Iwano, Koji ; Hirose, Keikichi
Author_Institution :
Dept. of Inf. & Commun. Eng., Tokyo Univ., Japan
Abstract :
A new method for prosodic word boundary detection in continuous speech was developed based on the statistical modeling of moraic transitions of fundamental frequency (F0) contours, formerly proposed by the authors. In the developed method, F0 contours of prosodic words were modeled separately according to the accent types. An input utterance was matched against the models and was divided into constituent prosodic words. By doing so, prosodic word boundaries can be obtained. The method was first applied to the boundary detection experiments of the ATR continuous speech corpus. With mora boundary locations given in the corpus, total detection rate reached 91.5%. Then the method was integrated into a continuous speech recognition scheme with unlimited vocabulary. A few percentage improvement was observed in mora recognition for the above corpus. Although all the experiments were done in closed conditions due to the corpus availability, the results indicated the usefulness of the proposed method
Keywords :
speech processing; speech recognition; statistical analysis; ATR continuous speech corpus; accent types; continuous speech recognition; detection rate; fundamental frequency contours; input utterance; mora recognition; moraic fundamental frequency contours; moraic transitions; prosodic word boundary detection; statistical modeling; Event detection; Frequency; Humans; Impedance matching; Speech processing; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758080