• DocumentCode
    177435
  • Title

    Improving dialogue classification using a topic space representation and a Gaussian classifier based on the decision rule

  • Author

    Morchid, Mohamed ; Dufour, Richard ; Bousquet, Pierre-Michel ; Bouallegue, Mohamed ; Linares, Georges ; De Mori, Renato

  • Author_Institution
    LIA, Univ. of Avignon, Avignon, France
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    126
  • Lastpage
    130
  • Abstract
    In this paper, we study the impact of dialogue representations and classification methods in the task of theme identification of telephone conversation services having highly imperfect automatic transcriptions. Two dialogue representations are firstly compared: the classical Term Frequency-Inverse Document Frequency with Gini purity criteria (TF-IDF-Gini) method and the Latent Dirichlet Allocation (LDA) approach. We then propose to study an original classification method that takes advantage of the LDA topic space representation, highlighted as the best dialogue representation. To do so, two assumptions about topic representation led us to choose a Gaussian process (GP) based method. This approach is compared with a Support Vector Machine (SVM) classification method. Results show that the GP approach is a better solution to deal with the multiple theme complexity of a dialogue, no matter the conditions studied (manual or automatic transcriptions). We finally discuss the impact of the topic space reduction on the classification accuracy.
  • Keywords
    Gaussian processes; interactive systems; pattern classification; speech recognition; support vector machines; Gaussian classifier; Gaussian process based method; LDA approach; SVM classification method; TF-IDF-Gini method; automatic transcriptions; decision rule; dialogue classification; dialogue representation; latent dirichlet allocation; multiple theme complexity; support vector machine; telephone conversation services; term frequency-inverse document frequency-Gini purity criteria; theme identification; topic space representation; Accuracy; Manuals; Resource management; Semantics; Speech; Support vector machines; Vectors; Gaussian process; Latent dirichlet allocation; SVM; Speech analytics; Theme classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853571
  • Filename
    6853571