• DocumentCode
    3861974
  • Title

    Spoken language recognition-a step toward multilinguality in speech processing

  • Author

    J. Navratil

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    9
  • Issue
    6
  • fYear
    2001
  • Firstpage
    678
  • Lastpage
    685
  • Abstract
    Automatic recognition of spoken languages has become an important feature in a variety of speech-enabled multilingual applications which, besides accuracy, also demand for efficient and "linguistically scalable" algorithms. This paper deals with a particularly successful approach based on phonotactic-acoustic features and presents systems for language identification as well as for unknown-language rejection. An architecture with multipath decoding, improved phonotactic models using binary-tree structures, and acoustic pronunciation models serve as a framework for experiments and discussion on these two tasks. In particular, language identification accuracy on a telephone-speech task (NIST´95 evaluation) in six and nine languages is presented together with results from a perceptual experiment carried out with human listeners. The performance of language rejection based on phonotactic modeling combined with a monolingual LVCSR system in the domain of broadcast news transcription is also reported. Besides yielding state-of-the-art performance, the described systems are computationally inexpensive and easily extensible (scalable) to new languages without the need for linguistic experts.
  • Keywords
    "Natural languages","Speech recognition","Speech processing","Automatic speech recognition","Humans","Broadcasting","Databases","Decoding","Information retrieval","Face"
  • Journal_Title
    IEEE Transactions on Speech and Audio Processing
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.943345
  • Filename
    943345