Title :
Application of Uni-Directional Microphone Array for Identifying English Pronunciation Errors
Author :
Zhang, Bo ; Zhuang, Xin ; Huang, Pan ; Feng, Chen ; Zhao, Jie
Author_Institution :
Coll. of Software, Nankai Univ., Tianjin, China
Abstract :
To identify the English pronunciation errors made by Chinese learners, this paper utilizes uni-directional microphones to construct a superdirective beamformer for capturing high quality input speech, and integrates the techniques of anti-model and confidence measure into the speech recognizer for accurate identification of the speaker´s pronunciation errors. As to the beamformer, although designing a superdirective beamformer using omni-directional microphones is widely reported, little work details the designing using uni-directional microphones. We integrate the transfer function of the uni-directional microphones into the signal model of the beamformer, and derive the expression of the superdirective beamformer under the diffuse noise assumption. As to the speech recognizer, an anti-model for each phone is trained from the training data excluding the tokens of that phone. By integrating these anti-models into the recognition network, the recognizer can align the user´s speech with the prompted text more accurately. Confidence measure is utilized to judge whether a segment of utterance is mis-pronounced. To justify the proposed techniques, simulated noises are injected into utterances provided by some Chinese undergraduates in Nankai University. Recognition results are compared with the judgment made by several English linguistic experts. Experiment result shows the effectiveness of the proposed beamforming algorithm and the recognition techniques.
Keywords :
array signal processing; microphone arrays; speech recognition; speech recognition equipment; english pronunciation errors; high quality input speech; speech recognizer; superdirective beamformer; uni-directional microphone array; Application software; Array signal processing; Educational institutions; Feedback; Microphone arrays; Software quality; Speech recognition; Text recognition; Training data; Transfer functions;
Conference_Titel :
Image and Signal Processing, 2009. CISP '09. 2nd International Congress on
Conference_Location :
Tianjin
Print_ISBN :
978-1-4244-4129-7
Electronic_ISBN :
978-1-4244-4131-0
DOI :
10.1109/CISP.2009.5303819