DocumentCode :
323769
Title :
Some solution to the missing feature problem in data classification, with application to noise robust ASR
Author :
Morris, Andrew C. ; Cooke, Martin P. ; Green, Phil D.
Author_Institution :
Dept. of Comput. Sci., Sheffield Univ., UK
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
737
Abstract :
We address the theoretical and practical issues involved in automatic speech recognition (ASR) when some of the observation data for the target signal is masked by other signals. Techniques discussed range from simple missing data imputation to Bayesian optimal classification. We have developed the Bayesian approach because this allows prior knowledge to be incorporated naturally into the recognition process, thereby permitting us to go beyond the simple “integrate over missing data” or “marginals” approach reported elsewhere, which we show to be inadequate for dealing with realistic patterns of missing data. After deriving general techniques for recognition with missing data, these techniques are formulated in the context of an HMM based CSR system. This scheme is evaluated under both random and more realistic patterns of missing data, with speech from the DARPA RM corpus and noise from NOISEX. We find that a key problem in real world recognition with missing data is that efficient ASR requires data vector components to be independent, and incomplete data cannot be orthogonalised in the usual way by projection. We show that use of spectral peaks only can provide an effective solution to this problem
Keywords :
Bayes methods; feature extraction; hidden Markov models; noise; optimisation; pattern classification; signal representation; signal resolution; spectral analysis; speech recognition; Bayesian optimal classification; DARPA RM corpus; HMM based CSR system; NOISEX; automatic speech recognition; computational auditory scene analysis; data classification; high resolution spectral data representation; incomplete data; independent data vector components; integrate over missing data; marginals; missing data recognition; missing feature problem solution; noise robust ASR; observation data; orthogonalisation techniques; random patterns; real world recognition; source separation; spectral peaks; target signal; Bayesian methods; Hidden Markov models; Radial basis function networks; Random variables; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675370
Filename :
675370
Link To Document :
بازگشت