DocumentCode
3443255
Title
Non-native English speech recognition using bilingual English lexicon and acoustic models
Author
Matsunaga, Shinichiro ; Ogawa, Anna ; Yamaguchi, Y. ; Imamura, Akiyuki
Author_Institution
NTT Cyber Space Labs., Kanagawa, Japan
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
This paper proposes an English speech recognition system which can recognize both non-native (i.e. Japanese) and native English speakers´ pronunciation of English speech. The system uses a bilingual pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The Japanese transcription is constructed considering typical Japanese pronunciation of English. Japanese and English acoustic models are used in recognizing both transcriptions, and the highest-likelihood word sequence obtained in combining with native English- and Japanese-pronounced words is the recognition result. Continuous speech recognition experiments show that the proposed system greatly improves Japanese-English speech recognition performance while maintaining the same performance level as that of a purely native English recognition system.
Keywords
acoustic signal detection; natural languages; speech recognition; acoustic models; bilingual English lexicon; continuous speech recognition; highest-likelihood word sequence; nonnative English speech recognition; performance level; phoneme transcriptions; pronunciation; Acoustic signal detection; Adaptation model; Databases; Information retrieval; Loudspeakers; Man machine systems; Natural languages; Portals; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198787
Filename
1198787
Link To Document