Title :
Phonetically adaptive cepstrum mean normalization for acoustic mismatch compensation
Author :
Morishima, Masatoshi ; Isobe, Toshihiro ; Takahashi, Jun-ichi
Author_Institution :
NTT Data Corp., Kanagawa, Japan
Abstract :
We propose a new technique that compensates for an acoustic mismatch. This technique is simple and can estimate the acoustic mismatch more accurately than conventional cepstrum mean normalization (CMN), because it takes into consideration the kind of phonemes and their frequency, and can calculate the acoustic mismatch in detail. In this procedure the acoustic mismatch can be estimated as the difference between the centroid vector of distorted speech and that of acoustic models. The cepstral mean of distorted speech is the centroid vector including the distortion. The centroid vector calculated from parameters of acoustic models is regarded as the centroid vector when the distorted speech is assumed to be clean speech. The acoustic models used for calculation are for phonemes that appear in the transcription of the speech. This technique achieves a high word error reduction rate of 73% for ordinary analog telephone speech and 70% for wireless telephone handset speech
Keywords :
cepstral analysis; errors; speech recognition; telephony; vectors; acoustic mismatch compensation; acoustic models; analog telephone speech; calculation; centroid vector; clean speech; phonemes; phonetically adaptive cepstrum mean normalization; speech distortion; speech processing; speech recognition; wireless telephone handset speech; word error reduction rate; Acoustic distortion; Background noise; Cepstral analysis; Cepstrum; Frequency estimation; Information technology; Laboratories; Speech enhancement; Speech recognition; Telephony;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.659121