DocumentCode :
321485
Title :
Phonetically adaptive cepstrum mean normalization for acoustic mismatch compensation
Author :
Morishima, Masatoshi ; Isobe, Toshihiro ; Takahashi, Jun-ichi
Author_Institution :
NTT Data Corp., Kanagawa, Japan
fYear :
1997
fDate :
14-17 Dec 1997
Firstpage :
436
Lastpage :
441
Abstract :
We propose a new technique that compensates for an acoustic mismatch. This technique is simple and can estimate the acoustic mismatch more accurately than conventional cepstrum mean normalization (CMN), because it takes into consideration the kind of phonemes and their frequency, and can calculate the acoustic mismatch in detail. In this procedure the acoustic mismatch can be estimated as the difference between the centroid vector of distorted speech and that of acoustic models. The cepstral mean of distorted speech is the centroid vector including the distortion. The centroid vector calculated from parameters of acoustic models is regarded as the centroid vector when the distorted speech is assumed to be clean speech. The acoustic models used for calculation are for phonemes that appear in the transcription of the speech. This technique achieves a high word error reduction rate of 73% for ordinary analog telephone speech and 70% for wireless telephone handset speech
Keywords :
cepstral analysis; errors; speech recognition; telephony; vectors; acoustic mismatch compensation; acoustic models; analog telephone speech; calculation; centroid vector; clean speech; phonemes; phonetically adaptive cepstrum mean normalization; speech distortion; speech processing; speech recognition; wireless telephone handset speech; word error reduction rate; Acoustic distortion; Background noise; Cepstral analysis; Cepstrum; Frequency estimation; Information technology; Laboratories; Speech enhancement; Speech recognition; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
Type :
conf
DOI :
10.1109/ASRU.1997.659121
Filename :
659121
Link To Document :
بازگشت