DocumentCode :
2416745
Title :
Rapid unsupervised adaptation using context independent phoneme model
Author :
Kobashikawa, S. ; Ogawa, Anna ; Yamaguchi, Yoshio ; Takahashi, Satoshi
Author_Institution :
NTT Cyber Space Labs., NTT Corp., Yokosuka, Japan
fYear :
2009
fDate :
25-28 May 2009
Firstpage :
209
Lastpage :
212
Abstract :
Users require rapid and highly accurate speech recognition systems. Accuracy could be improved by unsupervised adaptation as provided by CMLLR (Constrained Maximum Likelihood Linear Regression). CMLLR-based batch-type unsupervised adaptation estimates a single global transformation matrix by utilizing unsupervised labeling; unfortunately, it needs prior labeling and so is not rapid. Our proposed technique reduces the prior labeling time by using context independent phoneme models (monophones) and frame-by-frame statistics accumulation in unsupervised adaptation. The proposed technique further raises the accuracy by accumulating statistics with power and performing recognition with power after adaptation. Simulations using spontaneous speech show that the proposed technique reduced the total computational time of labeling and recognition by 52.2% while matching the recognition rate of the conventional unsupervised adaptation technique that uses context dependent phoneme models (triphones) statistics accumulation.
Keywords :
speech enhancement; speech recognition; statistics; context independent phoneme model; frame-by-frame statistics accumulation; monophones; rapid unsupervised adaptation; speech recognition systems; spontaneous speech; triphones; Acoustic beams; Automatic speech recognition; Computational modeling; Consumer electronics; Context modeling; Labeling; Laboratories; Maximum likelihood linear regression; Speech recognition; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Electronics, 2009. ISCE '09. IEEE 13th International Symposium on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-2975-2
Electronic_ISBN :
978-1-4244-2976-9
Type :
conf
DOI :
10.1109/ISCE.2009.5157000
Filename :
5157000
Link To Document :
بازگشت