• DocumentCode
    3424009
  • Title

    Robust speech recognition using missing data techniques in the prospect domain and fuzzy masks

  • Author

    Van Segbroeck, Maarten ; Van hamme, Hugo

  • Author_Institution
    Dept. ESAT, Katholieke Univ. Leuven, Leuven
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4393
  • Lastpage
    4396
  • Abstract
    Missing data theory (MDT) has been applied to handle the problem of noise-robust speech recognition. Conventional MDT-systems require acoustic models that are expressed in the log-spectral rather than in the cepstral domain, which leads to a loss in accuracy. Therefore, we have already introduced a MDT-technique that can be applied in any feature domain that is a linear transform of log-spectra. This MDT-system requires hard decisions about the reliability of each spectral component. When computed from noisy data, misclassification errors in the mask are hardly unavoidable and the recognition rate will significantly degrade. The risk of misclassifications can be reduced by estimating a probability that the component is reliable, e.g. a fuzzy mask. In this paper, we extend our MDT-system to be applied in the probabilistic decision framework. Experiments on the Aurora2 database demonstrate a further increase in recognition accuracy, especially at low SNRs.
  • Keywords
    probability; spectral analysis; speech recognition; transforms; Aurora2 database; MDT system; fuzzy masks; linear transform; missing data theory; probabilistic decision framework; prospect domain; robust speech recognition; spectral component; Additive noise; Cepstral analysis; Databases; Degradation; Detectors; Noise robustness; Speech enhancement; Speech recognition; Statistical analysis; System testing; fuzzy masks; missing data techniques; noise robustness; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518629
  • Filename
    4518629