Non-native English speech recognition using bilingual English lexicon and acoustic models

Author

Matsunaga, Shinichiro ; Ogawa, Anna ; Yamaguchi, Y. ; Imamura, Akiyuki

Author_Institution

NTT Cyber Space Labs., Kanagawa, Japan

Volume

1

fYear

2003

fDate

6-10 April 2003

Abstract

This paper proposes an English speech recognition system which can recognize both non-native (i.e. Japanese) and native English speakers´ pronunciation of English speech. The system uses a bilingual pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The Japanese transcription is constructed considering typical Japanese pronunciation of English. Japanese and English acoustic models are used in recognizing both transcriptions, and the highest-likelihood word sequence obtained in combining with native English- and Japanese-pronounced words is the recognition result. Continuous speech recognition experiments show that the proposed system greatly improves Japanese-English speech recognition performance while maintaining the same performance level as that of a purely native English recognition system.

Keywords

acoustic signal detection; natural languages; speech recognition; acoustic models; bilingual English lexicon; continuous speech recognition; highest-likelihood word sequence; nonnative English speech recognition; performance level; phoneme transcriptions; pronunciation; Acoustic signal detection; Adaptation model; Databases; Information retrieval; Loudspeakers; Man machine systems; Natural languages; Portals; Speech analysis; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-7663-3

Type

conf

DOI

10.1109/ICASSP.2003.1198787

Filename

1198787