• DocumentCode
    706130
  • Title

    Speech — Nonspeech discrimination based on speech-relevant spectrogram modulations

  • Author

    Wohlmayr, Michael ; Markaki, Maria ; Stylianou, Yannis

  • Author_Institution
    Comput. Sci. Dept., Univ. of Crete, Heraklion, Greece
  • fYear
    2007
  • fDate
    3-7 Sept. 2007
  • Firstpage
    1551
  • Lastpage
    1555
  • Abstract
    In this work, we adopt an information theoretic approach - the Information Bottleneck method - to extract the relevant modulation frequencies across both dimensions of a spectrogram, for speech / non-speech discrimination (music, animal vocalizations, environmental noises). A compact representation is built for each sound ensemble, consisting of the maximally informative features. We demonstrate the effectiveness of a simple thresholding classifier which is based on the similarity of a sound to each characteristic modulation spectrum. When we assess the performance of the classification system at various SNR conditions using F-measure, results are equally good to a recently proposed method based on the same features but having significantly greater complexity.
  • Keywords
    feature extraction; signal classification; speech processing; F-measure; characteristic modulation spectrum; compact representation; information bottleneck method; information theoretic approach; maximally informative features; modulation frequencies; sound ensemble; spectrogram; speech-non-speech discrimination; thresholding classifier; Complexity theory; Feature extraction; Modulation; Signal to noise ratio; Speech; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2007 15th European
  • Conference_Location
    Poznan
  • Print_ISBN
    978-839-2134-04-6
  • Type

    conf

  • Filename
    7099066