Title :
Robust speech recognition using missing data techniques in the prospect domain and fuzzy masks
Author :
Van Segbroeck, Maarten ; Van hamme, Hugo
Author_Institution :
Dept. ESAT, Katholieke Univ. Leuven, Leuven
fDate :
March 31 2008-April 4 2008
Abstract :
Missing data theory (MDT) has been applied to handle the problem of noise-robust speech recognition. Conventional MDT-systems require acoustic models that are expressed in the log-spectral rather than in the cepstral domain, which leads to a loss in accuracy. Therefore, we have already introduced a MDT-technique that can be applied in any feature domain that is a linear transform of log-spectra. This MDT-system requires hard decisions about the reliability of each spectral component. When computed from noisy data, misclassification errors in the mask are hardly unavoidable and the recognition rate will significantly degrade. The risk of misclassifications can be reduced by estimating a probability that the component is reliable, e.g. a fuzzy mask. In this paper, we extend our MDT-system to be applied in the probabilistic decision framework. Experiments on the Aurora2 database demonstrate a further increase in recognition accuracy, especially at low SNRs.
Keywords :
probability; spectral analysis; speech recognition; transforms; Aurora2 database; MDT system; fuzzy masks; linear transform; missing data theory; probabilistic decision framework; prospect domain; robust speech recognition; spectral component; Additive noise; Cepstral analysis; Databases; Degradation; Detectors; Noise robustness; Speech enhancement; Speech recognition; Statistical analysis; System testing; fuzzy masks; missing data techniques; noise robustness; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518629