DocumentCode
3688616
Title
Environmental sound classification with convolutional neural networks
Author
Karol J. Piczak
Author_Institution
Institute of Electronic Systems, Warsaw University of Technology
fYear
2015
Firstpage
1
Lastpage
6
Abstract
This paper evaluates the potential of convolutional neural networks in classifying short audio clips of environmental sounds. A deep model consisting of 2 convolutional layers with max-pooling and 2 fully connected layers is trained on a low level representation of audio data (segmented spectrograms) with deltas. The accuracy of the network is evaluated on 3 public datasets of environmental and urban recordings. The model outperforms baseline implementations relying on mel-frequency cepstral coefficients and achieves results comparable to other state-of-the-art approaches.
Keywords
"Neural networks","Training","Accuracy","Convolution","Convolutional codes","Yttrium","Pattern recognition"
Publisher
ieee
Conference_Titel
Machine Learning for Signal Processing (MLSP), 2015 IEEE 25th International Workshop on
Type
conf
DOI
10.1109/MLSP.2015.7324337
Filename
7324337
Link To Document