• DocumentCode
    3443255
  • Title

    Non-native English speech recognition using bilingual English lexicon and acoustic models

  • Author

    Matsunaga, Shinichiro ; Ogawa, Anna ; Yamaguchi, Y. ; Imamura, Akiyuki

  • Author_Institution
    NTT Cyber Space Labs., Kanagawa, Japan
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper proposes an English speech recognition system which can recognize both non-native (i.e. Japanese) and native English speakers´ pronunciation of English speech. The system uses a bilingual pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The Japanese transcription is constructed considering typical Japanese pronunciation of English. Japanese and English acoustic models are used in recognizing both transcriptions, and the highest-likelihood word sequence obtained in combining with native English- and Japanese-pronounced words is the recognition result. Continuous speech recognition experiments show that the proposed system greatly improves Japanese-English speech recognition performance while maintaining the same performance level as that of a purely native English recognition system.
  • Keywords
    acoustic signal detection; natural languages; speech recognition; acoustic models; bilingual English lexicon; continuous speech recognition; highest-likelihood word sequence; nonnative English speech recognition; performance level; phoneme transcriptions; pronunciation; Acoustic signal detection; Adaptation model; Databases; Information retrieval; Loudspeakers; Man machine systems; Natural languages; Portals; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198787
  • Filename
    1198787