• DocumentCode
    3529286
  • Title

    Factor analysis-based information integration for Arabic dialect identification

  • Author

    Lei, Yun ; Hansen, John H L

  • Author_Institution
    Erik Jonsson Sch. of Eng. & Comput. Sci., Univ. of Texas at Dallas, Richardson, TX
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4337
  • Lastpage
    4340
  • Abstract
    In this study, we propose a new factor analysis-based modeling technique to more clearly describe the composition of the supervector defined by the GMM model for dialect identification. The method utilizes knowledge types of information contained in the transcript file of the data. We evaluate the effects of the proposed modeling algorithm on a GMM-based Arabic dialect identification system. In particular, we compare eigenchannel modeling and our proposed information integration modeling. We show that the proposed modeling can obtain a 4.23% relative EER reduction with the same total number of factors, and a 9.37% relative EER reduction with the same number of channel/session factors versus eigenchannel modeling.
  • Keywords
    Gaussian processes; natural language processing; Arabic dialect identification; GMM model; eigenchannel modeling; factor analysis; information integration modeling; modeling technique; Computer science; Information analysis; Natural languages; Robustness; Speaker recognition; Speech analysis; Speech recognition; Streaming media; Subcontracting; Vocabulary; Arabic; dialect identification; factor analysis; information integration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960589
  • Filename
    4960589