• DocumentCode
    3688616
  • Title

    Environmental sound classification with convolutional neural networks

  • Author

    Karol J. Piczak

  • Author_Institution
    Institute of Electronic Systems, Warsaw University of Technology
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper evaluates the potential of convolutional neural networks in classifying short audio clips of environmental sounds. A deep model consisting of 2 convolutional layers with max-pooling and 2 fully connected layers is trained on a low level representation of audio data (segmented spectrograms) with deltas. The accuracy of the network is evaluated on 3 public datasets of environmental and urban recordings. The model outperforms baseline implementations relying on mel-frequency cepstral coefficients and achieves results comparable to other state-of-the-art approaches.
  • Keywords
    "Neural networks","Training","Accuracy","Convolution","Convolutional codes","Yttrium","Pattern recognition"
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing (MLSP), 2015 IEEE 25th International Workshop on
  • Type

    conf

  • DOI
    10.1109/MLSP.2015.7324337
  • Filename
    7324337