Title :
Improved estimation of supervision in unsupervised speaker adaptation
Author :
Homma, Shigeru ; Aikawa, Kiyoaki ; Sagayama, Shigeki
Author_Institution :
NTT Human Interface Labs., Kanagawa, Japan
Abstract :
Unsupervised speaker adaptation plays an important role in “batch dictation”, the aim of which is to automatically transcribe large amounts of recorded dictation using speech recognition. In the case of unsupervised speaker adaptation which uses recognition results of target speech as the means of supervision, erroneous recognition results degrade the quality of the adapted acoustic models. This paper presents a new supervision selection method. By using this method, correction of the first candidate is judged based on the likelihood ratio of the first and the second candidates. This method eliminates erroneous recognition results and corresponding speech data from the adaptive training data. We implemented this method in the iterative unsupervised speaker adaptation procedure. It is shown that the recognition errors are drastically reduced by 50% in a practical application of batch-style speech-to-text conversion of recorded dictation of Japanese medical diagnoses compared with speaker-independent recognition
Keywords :
adaptive estimation; dictation; iterative methods; speech recognition; unsupervised learning; Japanese medical diagnoses; adapted acoustic models; adaptive training; batch dictation; batch-style speech-to-text conversion; erroneous recognition results; iterative unsupervised speaker adaptation procedure; likelihood ratio; recorded dictation; speech recognition; supervision; target speech; transcription; unsupervised speaker adaptation; Automatic speech recognition; Degradation; Hidden Markov models; Humans; Iterative methods; Laboratories; Loudspeakers; Speech recognition; Target recognition; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596114