• DocumentCode
    3593906
  • Title

    Language identification through large vocabulary continuous speech recognition

  • Author

    Lim, Boon Pang ; Li, Haizhou ; Chen, Yu

  • Author_Institution
    Speech & Dialogue Process. Lab, Inst. for Infocomm Res., Singapore, Singapore
  • fYear
    2004
  • Firstpage
    49
  • Lastpage
    52
  • Abstract
    In recent years, automatic language identification has become an increasingly important component in practical spoken language systems, and much attention has been devoted to various competing approaches. In this paper, we are concerned with the automatic identification of languages that may be highly similar in nature, such as the various dialects of Chinese. Our approach differs from many recent successful systems by exploiting a fusion of feature scores readily available from a large vocabulary speech recognition system. We show that such features are able to distinguish among the similar sounding dialects of Chinese, and experiments on a nine language corpus show promising performance on a three way identification task.
  • Keywords
    feature extraction; linguistics; speech processing; speech recognition; vocabulary; Chinese dialects; automatic language identification; feature score fusion; large vocabulary continuous speech recognition; nine language corpus; performance; similar sounding dialects; spoken language systems; three way identification task; Automatic speech recognition; Automatic testing; Databases; Decoding; Engines; Natural languages; Speech processing; Speech recognition; System testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2004 International Symposium on
  • Print_ISBN
    0-7803-8678-7
  • Type

    conf

  • DOI
    10.1109/CHINSL.2004.1409583
  • Filename
    1409583