DocumentCode
3529531
Title
Unsupervised speaker adaptation for telephone call transcription
Author
Wallace, R. ; Thambiratnam, K. ; Seide, F.
Author_Institution
Speech & Audio Res. Lab., Queensland Univ. of Technol., Brisbane, QLD
fYear
2009
fDate
19-24 April 2009
Firstpage
4393
Lastpage
4396
Abstract
The use of the PC and Internet for placing telephone calls will present new opportunities to capture vast amounts of un-transcribed speech for a particular speaker. This paper investigates how to best exploit this data for speaker-dependent speech recognition. Supervised and unsupervised experiments in acoustic model and language model adaptation are presented. Using one hour of automatically transcribed speech per speaker with a word error rate of 36.0%, unsupervised adaptation resulted in an absolute gain of 6.3%, equivalent to 70% of the gain from the supervised case, with additional adaptation data likely to yield further improvements. LM adaptation experiments suggested that although there seems to be a small degree of speaker idiolect, adaptation to the speaker alone, without considering the topic of the conversation, is in itself unlikely to improve transcription accuracy.
Keywords
Internet telephony; acoustic signal processing; natural languages; speaker recognition; unsupervised learning; Internet; acoustic model adaptation; automatic speech transcription; language model adaptation; speaker-dependent speech recognition; telephone call transcription; unsupervised speaker adaptation; unsupervised training; Acoustic devices; Adaptation model; Australia; Automatic speech recognition; Error analysis; Internet telephony; Laboratories; Loudspeakers; Natural languages; Speech recognition; Speaker adaptation; acoustic model adaptation; language model adaptation; speech recognition; unsupervised adaptation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960603
Filename
4960603
Link To Document