Development of Japanese voice-activated word processor using isolated monosyllable recognition

Author

Nitta, T. ; Murata, T. ; Tsuboi, H. ; Takeda, K. ; Kawada, T. ; Watanabe, S.

Author_Institution

Toshiba Research and Development Center, Kawasaki, Japan

Volume

fYear

1982

fDate

30072

Firstpage

871

Lastpage

874

Abstract

This paper describes a newly developed voice-activated word processor and a two-stage recognition method to achieve a precise recognition of isolated monosyllables. At the first stage, the recognizer segments a monosyllable into an initial consonantal part and a final part (i.e., the vowel region), and computes similarities between the input speech and orthonormal mode functions of each consonantal segment which is designed from multiple speakers using K-L expansion and adapted to a new speaker ( Adaptive Multiple Similarity Method). At the second stage, frame-by-frame similarity scores, extracted at the phoneme recognizer using Multiple Similarity Method, are applied to candidate monosyllables to make a final decision. The average monosyllable recognition accuracy with six speakers was about 95%.

Keywords

Commercialization; Concatenated codes; Equations; Frequency; Natural languages; Pattern matching; Pattern recognition; Prototypes; Speech processing; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.

Type

conf

DOI

10.1109/ICASSP.1982.1171875

Filename

1171875

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3059719