Title :
Discrimination of environmental background noise sources using HOS based features of their filter bank decomposed sequences
Author :
Jhanwar, D. ; Sharma, K.K. ; Modani, S.G.
Author_Institution :
Gov. Eng. Coll., Ajmer, India
Abstract :
Discrimination of common environmental background noise sources like train, airport, car, restaurant, street and exhibition mixed with speech signals are required in many applications. These signals are stochastic, non-stationary, non-Gaussian, non-linear and with non-uniform distribution of spectral contents throughout its time length. In this paper, the signal under test is decomposed in different sequences by filtering through a filter bank of ranges 0-500Hz, 500-1000Hz, 1000-1500Hz, 1500-2000Hz, 2000-2500Hz, 2500-3000Hz and above 3000Hz. The feature vector contain the features of only those filtered decomposed sequences corresponding to the particular noise source which can discriminate the other noise sources for the decomposed sequence of same frequency band. The higher order statistics (HOS) based parameters like third-order autocumulant, fourth-order autocumulant, skewness and kurtosis are found to be efficient features for the same. The cumulant based features are modified here as the ratio of their values corresponding to noisy speech decomposed signal to the clean speech (without background noise) decomposed signal for the same frequency range are proved to give better results. It is observed that the extracted feature vectors of some of the decomposed sequences of different noise sources are found more discriminating as compared to without decomposition. Finally the classification of noise sources is done by separating the corresponding feature vectors using Gaussian mixture model (GMM) classifier.
Keywords :
Gaussian processes; channel bank filters; signal classification; signal denoising; speech processing; statistical analysis; GMM classifier; Gaussian mixture model; HOS based feature; cumulant based feature; environmental background noise source; feature vector; filter bank decomposed sequence; fourth-order autocumulant; frequency 0 Hz to 500 Hz; frequency 1000 Hz to 1500 Hz; frequency 1500 Hz to 2000 Hz; frequency 2000 Hz to 2500 Hz; frequency 2500 Hz to 3000 Hz; frequency 500 Hz to 1000 Hz; higher order statistics; kurtosis; noise source classification; nonGaussian signal; nonlinear signal; nonstationary signal; skewness; spectral content nonuniform distribution; speech signal; stochastic signal; third-order autocumulant; time length; Airports; Feature extraction; Filter banks; Noise; Noise measurement; Stochastic processes; Gaussian mixture model; Third-order cumulant; filter bank decomposed sequence; fourth-order cumulant; higher order statistics; kurtosis; skewness;
Conference_Titel :
Computing Communication & Networking Technologies (ICCCNT), 2012 Third International Conference on
Conference_Location :
Coimbatore
DOI :
10.1109/ICCCNT.2012.6395868