• DocumentCode
    3423257
  • Title

    Language identification using MLKSFM for pre-classification with novel front-end features

  • Author

    Wang, Liang ; Ambikairajah, Eliathamby ; Choi, Eric H C

  • Author_Institution
    Sch. of Electr. Eng. & Telecommun., Univ. of New South Wales, Sydney, NSW
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4245
  • Lastpage
    4248
  • Abstract
    This paper presents two novel contributions to automatic language identification. The first one is the use of the modified multi-layer Kohonen self-organizing feature map (MLKSFM) as a pre-classification for language identification (LID). Secondly, we discuss the novel application of empirical mode decomposition (EMD) to generate features for the LID pre-classification task. The use of instantaneous frequency (IF) and instantaneous amplitude (IA) of a speech signal as features for the pre-classifier is investigated. The experiment results on a 16-language speech database indicates that, the EMD by itself cannot perform well in the LID task, however it helps to improve the pre-classification rate when concatenated with other cepstral features. The overall LID performance is also increased when pre-classification is applied. We achieve LID rates of 85.2% and 62.3% for 45-sec and 10-sec test utterances, respectively.
  • Keywords
    audio databases; self-organising feature maps; speech recognition; 16-language speech database; empirical mode decomposition; instantaneous amplitude; instantaneous frequency; language identification; modified multi-layer Kohonen self-organizing feature map; speech signal; Amplitude estimation; Australia; Frequency estimation; Laboratories; Natural languages; Signal analysis; Spatial databases; Speech; System performance; Vector quantization; Language identification; empirical mode decomposition; modified MLKSFM; modified group delay function; pre-classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518592
  • Filename
    4518592