• DocumentCode
    3485918
  • Title

    Latent semantic analysis for question classification with neural networks

  • Author

    Loni, Babak ; Khoshnevis, Seyedeh Halleh ; Wiggers, Pascal

  • Author_Institution
    Dept. of Media & Knowledge Eng., Delft Univ. of Technol., Delft, Netherlands
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    437
  • Lastpage
    442
  • Abstract
    An important component of question answering systems is question classification. The task of question classification is to predict the entity type of the answer of a natural language question. Question classification is typically done using machine learning techniques. Most approaches use features based on word unigrams which leads to large feature space. In this work we applied Latent Semantic Analysis (LSA) technique to reduce the large feature space of questions to a much smaller and efficient feature space. We used two different classifiers: Back-Propagation Neural Networks (BPNN) and Support Vector Machines (SVM). We found that applying LSA on question classification can not only make the question classification more time efficient, but it also improves the classification accuracy by removing the redundant features. Furthermore, we discovered that when the original feature space is compact and efficient, its reduced space performs better than a large feature space with a rich set of features. In addition, we found that in the reduced feature space, BPNN performs better than SVMs which are widely used in question classification. Our result on the well known UIUC dataset is competitive with the state-of-the-art in this field, even though we used much smaller feature spaces.
  • Keywords
    backpropagation; neural nets; question answering (information retrieval); support vector machines; SVM; back-propagation neural networks; large feature space; latent semantic analysis; machine learning techniques; natural language question; question answering systems; question classification; support vector machines; Accuracy; Feature extraction; Kernel; Neurons; Semantics; Support vector machines; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163971
  • Filename
    6163971