• DocumentCode
    3109525
  • Title

    Semantic Chunk Annotation for questions using Maximum Entropy

  • Author

    Fan, Shixi ; Zhang, Yaoyun ; Ng, Wing W Y ; Wang, Xuan ; Wang, Xiaolong

  • Author_Institution
    Shenzhen Grad. Sch., Harbin Inst. of Technol., Shenzhen
  • fYear
    2008
  • fDate
    12-15 Oct. 2008
  • Firstpage
    450
  • Lastpage
    454
  • Abstract
    We present a ME (Maximum Entropy) model for Semantic Chunk Annotation in a Chinese Question and Answer (Q&A) system. The model was derived from a corpus of real world questions, which are collected from some discussion groups on the Internet. The questions are supposed to be answered by other people, so the questions are very complex. The semantic chunks were introduced. Feature for the model was described and MI (mutual information) was adopted for feature selection. The training data consists of 14000 sentences and the test data consists of 4000 sentences. The result: F-score is 90.68%.
  • Keywords
    maximum entropy methods; query processing; search engines; semantic Web; Chinese Question and Answer system; Internet; Semantic Chunk Annotation; feature selection; maximum entropy model; mutual information; Computer architecture; Computer science; Databases; Entropy; Information retrieval; Internet; Mutual information; Natural languages; Search engines; Testing; Maximum Entropy; Mutual information; Q&A; Semantic Chunk Annotation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
  • Conference_Location
    Singapore
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2383-5
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2008.4811317
  • Filename
    4811317