• DocumentCode
    312025
  • Title

    Subband-crosscorrelation analysis for robust speech recognition

  • Author

    Kajita, Shoji ; Takeda, Kazuya ; Itakura, Fumitada

  • Author_Institution
    Graduate Sch. of Eng., Nagoya Univ., Japan
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    422
  • Abstract
    This paper describes subband-crosscorrelation (SBXCOR) analysis using two channel signals. The SBXCOR analysis is an extended signal processing technique of subband-autocorrelation (SBCOR) analysis that extracts periodicities present in speech signals. In this paper, the performance of SBXCOR is investigated using a DTW word recognizer, under simulated acoustic conditions on computer and a real environmental condition. Under the simulated condition, it is assumed that speech signals in each channel are perfectly synchronized while noises are not correlated. Consequently, the effective signal-to-noise ratio of the signal generated by simply summing the two signals is raised about 3dB. In such a case, it is shown that SBXCOR is less robust than SBCOR extracted from the two-channel-summing signal, but more robust than the conventional one-channel SBCOR. The resultant performance was much better than that of the smoothed group delay spectrum and mel-frequency cepstral coefficient. In a real computer room, it is shown that SBXCOR is more robust than the two-channel-summed SBCOR
  • Keywords
    cepstral analysis; correlation methods; noise; signal processing; speech recognition; synchronisation; DTW word recognizer; SBCOR; SBXCOR; mel-frequency cepstral coefficient; noise; performance; periodicities; robust speech recognition; signal processing technique; signal-to-noise ratio; simulated acoustic conditions; smoothed group delay spectrum; speech signals; subband-crosscorrelation analysis; synchronization; two-channel-summing signal; Acoustic noise; Acoustic signal processing; Computational modeling; Computer simulation; Noise robustness; Signal analysis; Signal processing; Speech analysis; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607144
  • Filename
    607144