Title :
Hierarchical clustering and robust identification for block-based autoregressive speech parameter estimation
Author :
Ruofei Chen ; Cheung-Fat Chan
Author_Institution :
Dept. of Electron. Eng., City Univ. of Hong Kong, Kowloon, China
Abstract :
Given accurate system parameters like state transition matrix F and corruption mapping matrix H, clean speech autoregressive (AR) parameters can be effectively estimated from a series of noisy observations with Kalman filtering. In this paper, we address several fundamental issues to improve the linear dynamical system (LDS) based AR parameter estimation. A hierarchical time series clustering scheme is devised to truly group speech blocks with similar trajectories and corruption types. In addition, a correlated robust identification scheme using a posteriori signal-to-noise (SNR) mask is proposed to improve the identification accuracy. The effectiveness of the proposed clustering and identification scheme is evaluated in terms of spectral distortion between the Kalman estimates and the true clean speech parameters. Significant improvement is observed over the original matrix quantization (MQ) based approach. The proposed scheme is also successfully applied in a model-based speech enhancement application, and is expected to be effective in various codebook driven speech applications for robust identification purpose.
Keywords :
Kalman filters; autoregressive processes; matrix algebra; pattern clustering; speech enhancement; time series; Kalman filtering; LDS; MQ; SNR; a posteriori signal-to-noise mask; block-based autoregressive speech parameter estimation; clean speech autoregressive parameters; codebook driven speech applications; corruption mapping matrix; hierarchical time series clustering scheme; linear dynamical system based AR parameter estimation; matrix quantization based approach; model-based speech enhancement application; robust identification; state transition matrix; Estimation; Noise measurement; Signal to noise ratio; Speech; Trajectory; Vectors; autoregressive; clustering; identification; linear dynamical system; time series;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423482