• DocumentCode
    2769498
  • Title

    Speech recognition with localized time-frequency pattern detectors

  • Author

    Schutte, Ken ; Glass, James

  • Author_Institution
    MIT Comput. Sci. & Artificial Intelligence Lab., Cambridge
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    341
  • Lastpage
    346
  • Abstract
    A method for acoustic modeling of speech is presented which is based on learning and detecting the occurrence of localized time-frequency patterns in a spectrogram. A boosting algorithm is applied to both build classifiers and perform feature selection from a large set of features derived by filtering spectrograms. Initial experiments are performed to discriminate digits in the Aurora database. The system succeeds in learning sequences of localized time-frequency patterns which are highly interpretable from an acoustic-phonetic viewpoint. While the work and the results are preliminary, they suggest that pursuing these techniques further could lead to new approaches to acoustic modeling for ASR which are more noise robust and offer better encoding of temporal dynamics than typical features such as frame-based cepstra.
  • Keywords
    acoustic signal processing; speech recognition; time-frequency analysis; acoustic speech modeling; localized time-frequency pattern detectors; spectrogram; speech recognition; Acoustic noise; Acoustic signal detection; Automatic speech recognition; Boosting; Detectors; Filtering algorithms; Spatial databases; Spectrogram; Speech recognition; Time frequency analysis; acoustic modeling; automatic speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430135
  • Filename
    4430135