Title :
A comparative study of model-based adaptation techniques for a compact speech recognizer
Author :
Thiele, Frank ; Bippus, Rolf
Author_Institution :
Philips Res. Lab., Aachen, Germany
Abstract :
Many techniques for speaker adaptation have been successfully applied to automatic speech recognition. This paper compares the performance of several adaptation methods with respect to their memory need and processing demand. For adaptation of a compact acoustic model with 4k densities, eigenvoices and structural MAP (SMAP) are investigated next to the well-known techniques of MAP (maximum a posteriori) and MLLR (maximum likelihood linear regression) adaptation. Experimental results are reported for unsupervised on-line adaptation on different amounts of adaptation data ranging from 4 to 500 words per speaker. The results show that for small amounts of adaptation data it might be more efficient to employ a larger baseline acoustic model without adaptation. Eigenvoices achieve the lowest word error rates of all adaptation techniques but SMAP presents a good compromise between memory requirement and accuracy.
Keywords :
acoustic signal processing; eigenvalues and eigenfunctions; learning (artificial intelligence); maximum likelihood estimation; speech recognition; acoustic model; automatic speech recognition; compact speech recognizer; eigenvoices; maximum likelihood linear regression; model-based adaptation techniques; processing demand; speaker adaptation; structural MAP; training; unsupervised adaptation; Adaptation model; Automatic speech recognition; Command and control systems; Degradation; Error analysis; Laboratories; Loudspeakers; Maximum likelihood linear regression; Regression tree analysis; Speech recognition;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034581