DocumentCode
3522347
Title
Phoneme environment clustering for speech recognition
Author
Sagayama, Shigeki
Author_Institution
Human Interface Lab., NTT, Tokyo, Japan
fYear
1989
fDate
23-26 May 1989
Firstpage
397
Abstract
A general principle is proposed to solve problems in context-dependent phoneme segment (or subword unit) based speech recognition, namely, how to choose the set of units and how to estimate the context effect missing in the training data. A phoneme environment clustering algorithm, which automatically selects an optimal set of allophones and estimates the missing context, is presented. This algorithm additionally gives the means to analyze coarticulation effects automatically and quantitatively. The problem is formulated as a clustering technique in phoneme environment space to approximate the mapping function from phoneme environment space to phoneme pattern space by a limited number of centroid patterns based on a distortion measure defined on the phoneme pattern space. The algorithm is tested for phoneme recognition and word recognition, and results are discussed
Keywords
speech recognition; allophones; centroid patterns; coarticulation effects; context-dependent phoneme segment; distortion measure; mapping function; phoneme environment clustering algorithm; phoneme environment space; phoneme pattern space; phoneme recognition; speech recognition; subword unit; word recognition; Algorithm design and analysis; Clustering algorithms; Distortion measurement; Environmental factors; Extraterrestrial measurements; Humans; Laboratories; Speech analysis; Speech recognition; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location
Glasgow
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1989.266449
Filename
266449
Link To Document