DocumentCode :
542290
Title :
Analysis on individual differences in automatic transcription of spontaneous presentations
Author :
Shinozaki, Takahiro ; Furui, Sadaoki
Author_Institution :
Tokyo Institute of Technology, Department of Computer Science, 2-12-1 Ookayama, Meguro-ku, 152-8552 Japan
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
This paper reports an analysis of individual differences in spontaneous presentation speech recognition performances. Ten minutes from each presentation given by 50 male speakers, for a total of 500 minutes, has been automatically recognized for the analysis. Correlation and regression analyses were applied to the word recognition accuracy and various speaker attributes. A restricted set of the speaker attributes comprising the speaking rate, the out of vocabulary rate and the repair rate was found to be most significant to yield individual differences in the word accuracy. Unsupervised MLLR speaker adaptation worked well for improving the word accuracy but did not change the structure of the individual differences. Approximately half of the variance in the word accuracy was explained by a regression model using the limited set of three attributes.
Keywords :
Accuracy; Adaptation model; Hidden Markov models; Pragmatics; Silicon; Strontium; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743821
Filename :
5743821
Link To Document :
بازگشت