DocumentCode
3059719
Title
Development of Japanese voice-activated word processor using isolated monosyllable recognition
Author
Nitta, T. ; Murata, T. ; Tsuboi, H. ; Takeda, K. ; Kawada, T. ; Watanabe, S.
Author_Institution
Toshiba Research and Development Center, Kawasaki, Japan
Volume
7
fYear
1982
fDate
30072
Firstpage
871
Lastpage
874
Abstract
This paper describes a newly developed voice-activated word processor and a two-stage recognition method to achieve a precise recognition of isolated monosyllables. At the first stage, the recognizer segments a monosyllable into an initial consonantal part and a final part (i.e., the vowel region), and computes similarities between the input speech and orthonormal mode functions of each consonantal segment which is designed from multiple speakers using K-L expansion and adapted to a new speaker ( Adaptive Multiple Similarity Method). At the second stage, frame-by-frame similarity scores, extracted at the phoneme recognizer using Multiple Similarity Method, are applied to candidate monosyllables to make a final decision. The average monosyllable recognition accuracy with six speakers was about 95%.
Keywords
Commercialization; Concatenated codes; Equations; Frequency; Natural languages; Pattern matching; Pattern recognition; Prototypes; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
Type
conf
DOI
10.1109/ICASSP.1982.1171875
Filename
1171875
Link To Document