Title :
Noise Characterization and Classification for Background Estimation
Author :
Maithani, Sunita ; Tyagi, Richa
Author_Institution :
Sci. Anal. Group Defense Res. & Dev. Organ., Delhi
Abstract :
Characterization and classification of noise in the noisy speech is an important part of noisy speech processing in all related fields of application for speech technology. In this paper, problem of noise characterization and categorization of various environmental noises in the noisy speech of different SNR´s (Signal to Noise Ratio) has been handled. This paper purposes a technique for estimation of noise level and classifying the noises in a noisy speech into different categories of noises for background estimation. The technique detects non-speech regions in the noisy speech for SNR estimation and noise classification purpose. Three dimensional representation of noise is obtained using technique, which discriminates visual imprints of different noises in most optimum way in the 2D plane. For classification of test noise, prominent features from the 2D representation of noise are selected and classified using minimum hamming distance classifier using reference standard noise. Two dimensional second order derivatives of FFT magnitude matrix w.r.t. time and frequency are computed from the extracted pause regions. The technique is computationally simple and efficient and gives very high level of accuracy in the noisy speech with segmental SNR range of 20 dB to -20 dB. The test noises are categorized into white, babble, factory, vehicular and channel noises. The efficacy of technique developed is dependent on the limitations of the particular speech and pause detection algorithm used, particularly at moderate value of SNR. At lower value of SNR no such limitations exists and accuracy goes up to even 100%.
Keywords :
fast Fourier transforms; feature extraction; noise (working environment); signal classification; signal detection; signal representation; speech processing; speech recognition; FFT; SNR estimation; babble noise; background estimation; channel noise; environmental noises; factory noise; fast Fourier transform; minimum hamming distance classifier; noise characterization; noise classification; noisy speech processing; nonspeech region detection; reference standard noise; speech-pause detection algorithm; three dimensional noise representation; vehicular noise; white noise; Background noise; Frequency; Hamming distance; Noise level; Production facilities; Signal to noise ratio; Speech enhancement; Speech processing; Testing; Working environment noise; FFT (Fast Fourier Transform); LPC; LSF; MDC (Minimum Distance Classifier); SNR; Speech-Pause Detection;
Conference_Titel :
Signal Processing, Communications and Networking, 2008. ICSCN '08. International Conference on
Conference_Location :
Chennai
Print_ISBN :
978-1-4244-1924-1
Electronic_ISBN :
978-1-4244-1924-1
DOI :
10.1109/ICSCN.2008.4447190