DocumentCode :
336739
Title :
Automatic clustering and generation of contextual questions for tied states in hidden Markov models
Author :
Singh, R. ; Raj, B. ; Stern, Richard M.
Author_Institution :
Sch. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
1
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
117
Abstract :
Most current automatic speech recognition systems based on HMMs cluster or tie together subsets of the subword units with which speech is represented. This tying improves the recognition accuracy when systems are trained with limited data, and is performed by classifying the sub-phonetic units using a series of binary tests based on speech production, called “linguistic questions”. This paper describes a new method for automatically determining the best combinations of subword units to form these questions. The hybrid algorithm proposed clusters state distributions of context-independent phones to obtain questions for triphonetic contexts. Experiments confirm that the questions thus generated can replace manually generated questions and can provide improved recognition accuracy. Automatic generation of questions has the additional important advantage of extensibility to languages for which the phonetic structure is not well understood by the system designer, and can be effectively used in situations where the subword units are not phonetically motivated
Keywords :
hidden Markov models; pattern classification; pattern clustering; speech recognition; statistical analysis; HMM; automatic clustering; automatic generation; binary tests; context-independent phones; contextual questions; experiments; hidden Markov models; hybrid algorithm; languages; linguistic questions; pattern classification; phonetic structure; recognition accuracy; speech production; state distributions; sub-phonetic units classification; subword units; tied states; triphonetic contexts; Automatic speech recognition; Clustering algorithms; Computer science; Hidden Markov models; Humans; Maximum likelihood estimation; Performance evaluation; Production systems; Speech recognition; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.758076
Filename :
758076
Link To Document :
بازگشت