• DocumentCode
    3604606
  • Title

    Subband Time-Frequency Image Texture Features for Robust Audio Surveillance

  • Author

    Sharan, Roneel V. ; Moir, Tom J.

  • Author_Institution
    Sch. of Eng., Auckland Univ. of Technol., Auckland, New Zealand
  • Volume
    10
  • Issue
    12
  • fYear
    2015
  • Firstpage
    2605
  • Lastpage
    2615
  • Abstract
    In this paper, we utilize time-frequency image representations of sound signals for feature extraction in an audio surveillance application. Starting with the conventional spectrogram images, we consider a new feature which is based on image texture analysis. It utilizes the gray-level co-occurrence matrix, which captures the distribution of co-occurring values at a given offset. We refer this as the spectrogram image texture feature. Texture analysis is carried out in subbands and experimented on a sound database containing ten classes with each sound class containing multiple subclasses. The proposed feature was seen to be more noise robust than two commonly used cepstral features, mel-frequency cepstral coefficients and gammatone cepstral coefficients, the spectrogram image feature (SIF), where central moments are extracted as features, and a variation of SIF with reduced feature dimension. In addition, we achieved a significant improvement in classification accuracy for the three time-frequency image features by utilizing a gammatone filter-based time-frequency image, referred as cochleagram image, for feature extraction instead of the spectrogram image. A combination of cepstral and cochleagram image features also gave improvement in the classification performance.
  • Keywords
    audio signal processing; feature extraction; image classification; image filtering; image representation; image texture; matrix algebra; time-frequency analysis; SIF; cochleagram image; gamma-tone cepstral coefficient; gammatone filter-based time-frequency image; gray-level co-occurrence matrix; image classification; mel-frequency cepstral coefficient; robust audio surveillance; spectrogram image feature; subband time-frequency image texture feature extraction; time-frequency image representation; Cepstral analysis; Feature extraction; Filter banks; Noise; Spectrogram; Time-frequency analysis; Audio surveillance; cochleagram; gammatone filter; gray-level co-occurrence matrix; spectrogram; support vector machines;
  • fLanguage
    English
  • Journal_Title
    Information Forensics and Security, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1556-6013
  • Type

    jour

  • DOI
    10.1109/TIFS.2015.2469254
  • Filename
    7206602