Title :
Robust speech recognition by properly utilizing reliable frames and segments in corrupted signals
Author :
Chen, Yi ; Wan, Chia-yu ; Lee, Lin-shan
Author_Institution :
Nat. Taiwan Univ., Taipei
Abstract :
In this paper, we propose a new approach to detecting and utilizing reliable frames and segments in corrupted signals for robust speech recognition. Novel approaches to estimating an energy-based measure and a harmonicity measure for each frame are developed. SNR-dependent GMM classifiers are then trained, together with a reliable frame selection and clustering module and a reliable segment identification module, to detect the most reliable frames in an utterance. These reliable frames and segments thus obtained can be properly used in both front-end feature enhancement and back-end Viterbi decoding. In the extensive experiments reported here, very significant improvements in recognition accuracies were obtained with the proposed approaches for all types of noise and all SNR values defined in the Aurora 2 database.
Keywords :
Gaussian processes; Viterbi decoding; signal detection; speech coding; speech enhancement; speech recognition; statistical analysis; Aurora 2 database; GMM; back-end Viterbi decoding; clustering module; corrupted signal; front-end feature enhancement; reliable frame detection; reliable segment detection; speech recognition; Cepstral analysis; Decoding; Energy measurement; Robustness; Signal processing; Speech analysis; Speech enhancement; Speech processing; Speech recognition; Viterbi algorithm; Harmonic analysis; Viterbi decoding; robustness; speech recognition;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430091