Title :
Speech enhancement using non negative matrix factorization and enhanced NMF
Author :
Akarsh, K.A. ; Selvi, R. Senthamizh
Author_Institution :
Dept. of Electron. & Commun. Eng., Easwari Eng. Coll., Chennai, India
Abstract :
Speech enhancement is a dominant research area, which is used to augment the degraded speech. This paper is based on speech enhancement using Bayesian Formulation of Nonnegative Matrix Factorization algorithm with Hidden Markov Model (BNMF HMM) for supervised speech enhancement. This paper, explore a new class of unsupervised speech de-noising algorithm known as Enhanced NMF (ENMF), which is used to develop a speech enhancement system neither the speaker identity nor the type of noise is known in advance. NMF methods are used for source separation. Extensive experiments are carried out to examine the performance of the proposed method under different circumstances. Furthermore, this work contrasts the recital of the developed algorithms with state of the art speech enhancement schemes using various objective and subjective measures. The experimental results show that Enhanced NMF (ENMF) method is more efficient than other conservative methods by using Signal to Noise Ratio (SNR) in dB, Signal to Distortion Ratio (SDR) in dB and Perceptual Evaluation of Speech Quality (PESQ) in Mean Opinion Score.
Keywords :
hidden Markov models; matrix algebra; speech enhancement; BNMF HMM; Bayesian formulation of nonnegative matrix factorization algorithm with Hidden Markov Model; NMF; PESQ; SDR; SNR; degraded speech; mean opinion score; non negative matrix factorization; perceptual evaluation of speech quality; signal to distortion ratio; signal to noise ratio; source separation; speaker identity; speech denoising algorithm; speech enhancement system; supervised speech enhancement; Hidden Markov models; Noise measurement; Signal to noise ratio; Spectrogram; Speech; Speech enhancement; Bayesian Formulation of Nonnegative Matrix Factorization (BNMF); Enhanced NMF (ENMF); Hidden Markov model (HMM); Mean Opinion Score (MOS); Perceptual Evaluation of Speech Quality (PESQ); Signal to Distortion Ratio (SDR);
Conference_Titel :
Circuit, Power and Computing Technologies (ICCPCT), 2015 International Conference on
Conference_Location :
Nagercoil
DOI :
10.1109/ICCPCT.2015.7159386