DocumentCode
2174094
Title
An investigation of subspace modeling for phonetic and speaker variability in automatic speech recognition
Author
Rose, Richard ; Yin, Shou-Chun ; Tang, Yun
Author_Institution
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
fYear
2011
fDate
22-27 May 2011
Firstpage
4508
Lastpage
4511
Abstract
This paper investigates the impact of sub space based techniques for acoustic modeling in automatic speech recognition (ASR). There are many well known approaches to subspace based speaker adaptation which represent sources of variability as a projection within a low dimensional subspace. A new approach to acoustic modeling in ASR, referred to as the subspace based Gaussian mixture model (SGMM), represents phonetic variability as a set of projections applied at the state level in a hidden Markov model (HMM) based acoustic model. The impact of the SGMM in modeling these intrinsic sources of variability is evaluated for a continuous speech recognition (CSR) task. The SGMM is shown to provide an 18% reduction in word error rate (WER) for speaker independent (SI) ASR relative to the continuous density HMM (CDHMM) in the resource management CSR domain. The SI performance obtained from SGMM also represents a 5% reduction in WER relative to subspace based speaker adaption in an unsupervised speaker adaptation scenario.
Keywords
hidden Markov models; speech recognition; ASR; CDHMM; SGMM; SI ASR; WER; acoustic model; automatic speech recognition; continuous density HMM; continuous speech recognition task; hidden Markov model; phonetic variability; resource management CSR domain; speaker adaptation; speaker independent ASR; speaker variability; subspace based Gaussian mixture model; subspace modeling; word error rate; Acoustics; Adaptation models; Equations; Hidden Markov models; Mathematical model; Speech recognition; Training; automated speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947356
Filename
5947356
Link To Document