DocumentCode :
3529531
Title :
Unsupervised speaker adaptation for telephone call transcription
Author :
Wallace, R. ; Thambiratnam, K. ; Seide, F.
Author_Institution :
Speech & Audio Res. Lab., Queensland Univ. of Technol., Brisbane, QLD
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4393
Lastpage :
4396
Abstract :
The use of the PC and Internet for placing telephone calls will present new opportunities to capture vast amounts of un-transcribed speech for a particular speaker. This paper investigates how to best exploit this data for speaker-dependent speech recognition. Supervised and unsupervised experiments in acoustic model and language model adaptation are presented. Using one hour of automatically transcribed speech per speaker with a word error rate of 36.0%, unsupervised adaptation resulted in an absolute gain of 6.3%, equivalent to 70% of the gain from the supervised case, with additional adaptation data likely to yield further improvements. LM adaptation experiments suggested that although there seems to be a small degree of speaker idiolect, adaptation to the speaker alone, without considering the topic of the conversation, is in itself unlikely to improve transcription accuracy.
Keywords :
Internet telephony; acoustic signal processing; natural languages; speaker recognition; unsupervised learning; Internet; acoustic model adaptation; automatic speech transcription; language model adaptation; speaker-dependent speech recognition; telephone call transcription; unsupervised speaker adaptation; unsupervised training; Acoustic devices; Adaptation model; Australia; Automatic speech recognition; Error analysis; Internet telephony; Laboratories; Loudspeakers; Natural languages; Speech recognition; Speaker adaptation; acoustic model adaptation; language model adaptation; speech recognition; unsupervised adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960603
Filename :
4960603
Link To Document :
بازگشت