• DocumentCode
    3403527
  • Title

    Environmental sound recognition using MP-based features

  • Author

    Chu, Selina ; Narayanan, Shrikanth ; Kuo, C. -C Jay

  • Author_Institution
    Viterbi Sch. of Eng., Univ. of Southern California, Los Angeles, CA
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Defining suitable features for environmental sounds is an important problem in an automatic acoustic scene recognition system. As with most pattern recognition problems, extracting the right feature set is the key to effective performance. A variety of features have been proposed for audio recognition, but the vast majority of the past work utilizes features that are well-known for structured data, such as speech and music, and assumes this association will transfer naturally well to unstructured sounds. In this paper, we propose a novel method based on matching pursuit (MP) to analyze environment sounds for their feature extraction. The proposed MP-based method utilizes a dictionary from which to select features, resulting in a representation that is flexible, yet intuitive and physically interpretable. We will show that these features are less sensitive to noise and are capable of effectively representing sounds that originate from different sources and different frequency ranges. The MP- based feature can be used to supplement another well-known audio feature, i.e. MFCC, to yield higher recognition accuracy for environmental sounds.
  • Keywords
    acoustic signal processing; audio signal processing; feature extraction; iterative methods; MP-based features; audio recognition; automatic acoustic scene recognition system; environmental sound recognition; feature set extraction; matching pursuit; Acoustic noise; Automatic speech recognition; Data mining; Dictionaries; Feature extraction; Layout; Matching pursuit algorithms; Music; Pattern recognition; Speech recognition; Environmental sounds; audio classification; auditory scene recognition; feature extraction; matching pursuit;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4517531
  • Filename
    4517531