Title :
Cochleagram image feature for improved robustness in sound recognition
Author :
Sharan, Roneel V. ; Moir, Tom J.
Author_Institution :
School of Engineering, Auckland University of Technology, Private Bag 92006, 1142, New Zealand
Abstract :
In this paper, we use the cochleagram image of sound signals for time-frequency analysis and feature extraction, instead of the conventional spectrogram image, in an audio surveillance application. The signal is firstly passed through a gammatone filter which models the auditory filters in the human cochlea. The filtered signal is then divided into small windows and the energy in each window is added and normalized which gives the intensity values of the cochleagram image. We then divide the cochleagram image into blocks and extract central moments as features. Using two feature vector representation methods, the results show significant improvement in overall classification accuracy when compared to results from literature employing similar feature extraction and representation techniques but using spectrogram images. The most improved results were at low signal-to-noise ratios.
Keywords :
Image recognition; Image resolution; Noise; Speech; Speech recognition; audio surveillance; central moments; cochleagram; sound recognition; support vector machine;
Conference_Titel :
Digital Signal Processing (DSP), 2015 IEEE International Conference on
Conference_Location :
Singapore, Singapore
DOI :
10.1109/ICDSP.2015.7251910