• DocumentCode
    3444090
  • Title

    Improved Sentence Similarity Algorithm based on VSM and its application in Question Answering System

  • Author

    Liang, Xu ; Wang, Dongjiao ; Huang, Ming

  • Author_Institution
    Software Technol. Inst., Dalian Jiaotong Univ., Dalian, China
  • Volume
    1
  • fYear
    2010
  • fDate
    29-31 Oct. 2010
  • Firstpage
    368
  • Lastpage
    371
  • Abstract
    In the FAQ-based Chinese Question Answering System, the most critical issue is how to calculate the similarity between the user questions and the questions in the FAQ. The traditional VSM-based Sentence Similarity Algorithm usually regards word as the basic linguistic unit of sentences and mainly considers the statistical information of words in questions, but doesn´t take the word importance in the professional field and the semantic information of words into account. For these reasons, this paper proposes an Improved Sentence Similarity Algorithm Based on VSM, regarding notion as the basic linguistic unit of sentences, through conceptually abstracting and professionally classifying to improve the performance of Sentence Similarity Algorithm. Testing in Chinese FAQ system of specific areas, experimental result shows that the performance of the improved algorithm is superior to the traditional VSM-based sentence similarity algorithm evidently.
  • Keywords
    classification; query processing; text analysis; Chinese FAQ system; Chinese question answering system; VSM; classification; linguistic unit; professional field; semantic information; sentence similarity algorithm; statistical information; words; Accuracy; Semantics; Chinese question answering system; FAQ; VSM model; sentence similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computing and Intelligent Systems (ICIS), 2010 IEEE International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4244-6582-8
  • Type

    conf

  • DOI
    10.1109/ICICISYS.2010.5658525
  • Filename
    5658525