• DocumentCode
    2361769
  • Title

    Automatic language identification and discrimination using the modified group delay feature

  • Author

    Hegde, Rajesh M. ; Murthy, Hema A.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Chennai, India
  • fYear
    2005
  • fDate
    4-7 Jan. 2005
  • Firstpage
    395
  • Lastpage
    399
  • Abstract
    Automatic language identification (LID) systems use features derived from the Fourier transform magnitude like MFCC, its derivatives and also PLP cepstra. Though half of the underlying spectral information is discarded in these cases, attempts to utilize the phase spectrum for deriving features have been minimal. This paper investigates the use of features derived from the Fourier transform phase for implementing LID systems. Features derived from the modified group delay function which we call the modified group delay feature (MODGDF) are used in this study. Performance of the MODCDF and the traditional MFCC for a GMM based LID system for a 3 and 1.1 language task are discussed. Results of language discriminability analysis are also presented. The MODGDF is found to outperform MFCC in terms of both performance and discriminability of languages.
  • Keywords
    Fourier transforms; cepstral analysis; speaker recognition; speech processing; Fourier transform; GMM based LID system; automatic language discrimination; automatic language identification; modified group delay feature; Computer science; Data mining; Delay; Feature extraction; Fourier transforms; Mel frequency cepstral coefficient; Natural languages; Speech recognition; Surges; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Sensing and Information Processing, 2005. Proceedings of 2005 International Conference on
  • Print_ISBN
    0-7803-8840-2
  • Type

    conf

  • DOI
    10.1109/ICISIP.2005.1529484
  • Filename
    1529484