DocumentCode
1783022
Title
Acoustic model optimization for automatic pronunciation quality assessment
Author
Ke Yan
Author_Institution
New Generation of Inf. Technol. Center, China Acad. of Eng. Phys., Mianyang, China
fYear
2014
fDate
28-29 Sept. 2014
Firstpage
1
Lastpage
4
Abstract
Golden acoustic models, which are built or optimized using standard pronunciation data, are widely used in automatic pronunciation quality assessment. However, this work points out that because of the mismatch between training and test, golden acoustic models are unable to accurately measure the pronunciation quality for accented speeches. To deal with the problem, this paper presents a novel approach which uses both standard and non-standard speeches to optimize acoustic model by minimizing the root mean square error between human and machine scores. And we also derive an EBW-like algorithm for parameter optimization. The experimental results proved the effectiveness. The cross correlation increases from 0.610 to 0.713 and the root mean square error reduces from 1.930 to 1.685.
Keywords
acoustic signal processing; least mean squares methods; speech recognition; ASR acoustic modeling; EBW-like algorithm; acoustic model optimization; automatic pronunciation quality assessment; automatic speech recognition; cross correlation; parameter optimization; root mean square error minimization; Acoustic measurements; Acoustics; Data models; Hidden Markov models; Optimization; Quality assessment; Speech; EBW algorithm; acoustic modeling; automatic pronunciation quality assessment; computer assisted language learning; golden acoustic model;
fLanguage
English
Publisher
ieee
Conference_Titel
Multisensor Fusion and Information Integration for Intelligent Systems (MFI), 2014 International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6731-5
Type
conf
DOI
10.1109/MFI.2014.6997649
Filename
6997649
Link To Document