• DocumentCode
    1550406
  • Title

    Language Identification: A Tutorial

  • Author

    Ambikairajah, Eliathamby ; Li, Haizhou ; Wang, Liang ; Yin, Bo ; Sethu, Vidhyasaharan

  • Author_Institution
    Univ. of New South Wales, Sydney, NSW, Australia
  • Volume
    11
  • Issue
    2
  • fYear
    2011
  • Firstpage
    82
  • Lastpage
    108
  • Abstract
    This tutorial presents an overview of the progression of spoken language identification (LID) systems and current developments. The introduction provides a background on automatic language identification systems using syntactic, morphological, and in particular, acoustic, phonetic, phonotactic and prosodic level information. Different frontend features that are used in LID systems are presented. Several normalization and language modelling techniques have also been presented. We also discuss different LID system architectures that embrace a variety of front-ends and back-ends, and configurations such as hierarchical and fusion classifiers. Evaluations of the LID system are presented using NIST language recognition evaluation tasks.
  • Keywords
    natural language processing; speech recognition; LID system architectures; NIST language recognition; automatic language identification systems; language modelling techniques; spoken language identification; Automatic speech recognition; Identifications; Mel frequency cepstral coefficient; Speech recognition; Tutorials;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1531-636X
  • Type

    jour

  • DOI
    10.1109/MCAS.2011.941081
  • Filename
    5871469