Title :
Speaker adaptation with all-pass transforms
Author :
McDonough, John ; Byrne, William
Author_Institution :
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
In previous work, a class of transforms were proposed which achieve a remapping of the frequency axis much like conventional vocal tract length normalization. These mappings, known collectively as all-pass transforms (APT), were shown to produce substantial improvements in the performance of a large vocabulary speech recognition system when used to normalize incoming speech prior to recognition. In this application, the most advantageous characteristic of the APT was its cepstral-domain linearity; this linearity makes speaker normalization simple to implement, and provides for the robust estimation of the parameters characterizing individual speakers. In the current work, we exploit the APT to develop a speaker adaptation scheme in which the cepstral means of a speech recognition model are transformed to better match the speech of a given speaker. In a set of speech recognition experiments conducted on the Switchboard corpus, we report reductions in word error rate of 3.7% absolute
Keywords :
adaptive signal processing; cepstral analysis; hidden Markov models; parameter estimation; speech processing; speech recognition; transforms; HMM; Switchboard corpus; all-pass transforms; cepstral means; cepstral-domain linearity; frequency axis remapping; hidden Markov model; large vocabulary speech recognition system; performance; robust estimation; speaker adaptation; speaker normalization; speech recognition experiments; speech recognition model; vocal tract length normalization; word error rate reduction; Cepstral analysis; Error analysis; Hidden Markov models; Linearity; Maximum likelihood linear regression; Natural languages; Robustness; Speech processing; Speech recognition; Transforms;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.759778