Title :
Speech conversion from clean conditions to telephone ones
Author :
Zuo, Guoyu ; Liu, Wenju ; Ruan, Xiaogang
Author_Institution :
Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing, China
Abstract :
This paper addresses the application in speech recognition of simulated telephone speech, which is generated from clean speech by approximately mimicking actual telephone channel conditions. Maximum Likelihood Linear Regression (MLLR) algorithm was performed to conduct experiments on evaluating the performances of HMM recognizers, which were trained from clean speech and from generated telephone data, respectively. The test and adaptation data were recorded by piping clean speech through local telephone network. The experiments without adaptation report that the simulation models trained on generated data can give an obviously higher rate than the clean speech. The adaptation performances show that MLLR lends itself to further improve the recognition performance of telephone recognition system. The results show that telephone speech recognition performance can be effectively improved using the generated data, and its generating method can reduce the acoustic mismatch between training and testing data that was induced by the shortage of actual telephone speech.
Keywords :
hidden Markov models; maximum likelihood estimation; regression analysis; speech recognition; telephone networks; HMM recognizers; adaptation data performance; clean conditions; hidden Markov models; local telephone network; maximum likelihood linear regression algorithm; simulated telephone speech; speech conversion; speech recognition; speech testing data; speech training data; telephone channel conditions; telephone data; telephone recognition system; Acoustic testing; Automation; Hidden Markov models; Laboratories; Loudspeakers; Maximum likelihood linear regression; Pattern recognition; Performance evaluation; Speech recognition; Telephony;
Conference_Titel :
Intelligent Control and Automation, 2004. WCICA 2004. Fifth World Congress on
Print_ISBN :
0-7803-8273-0
DOI :
10.1109/WCICA.2004.1342303