• DocumentCode
    3413114
  • Title

    Combining robust spike coding with spiking neural networks for sound event classification

  • Author

    Dennis, Jonathan ; Tran Huy Dat ; Haizhou Li

  • Author_Institution
    Inst. for Infocomm Res., A*STAR, Singapore, Singapore
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    176
  • Lastpage
    180
  • Abstract
    This paper proposes a novel biologically inspired method for sound event classification which combines spike coding with a spiking neural network (SNN). Our spike coding extracts keypoints that represent the local maxima components of the sound spectrogram, and are encoded based on their local time-frequency information; hence both location and spectral information are being extracted. We then design a modified tempotron SNN that, unlike the original tempotron, allows the network to learn the temporal distributions of spike coding input, in an analogous way to the generalized Hough transform. The proposed method simultaneously enhances the sparsity of the sound event spectrogram, producing a representation which is robust against noise, as well as maximises the discriminability of the spike coding input in terms of its temporal information, which is important for sound event classification. Experimental results on a large dataset of 50 environment sound events show the superiority of both the spike coding versus the raw spectrogram and the SNN versus conventional cross-entropy neural networks.
  • Keywords
    Hough transforms; encoding; neural nets; signal classification; biologically inspired method; generalized Hough transform; local maxima components; local time-frequency information; location information; modified tempotron SNN; robust spike coding; sound event classification; sound event spectrogram; spectral information; spiking neural networks; temporal distributions; Cost function; Encoding; Feature extraction; Neural networks; Noise; Robustness; Spectrogram; Neural spike coding; local spectrogram features; noise robust; sound event classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177955
  • Filename
    7177955