• DocumentCode
    591100
  • Title

    Paraphrase extraction from interactive Q&A communities

  • Author

    Hu Hongsi ; Zhang Wenbo ; Yao Tianfang

  • Author_Institution
    UDS-SJTU Joint Res. Lab. for Language Technol., Shanghai Jiaotong Univ., Shanghai, China
  • fYear
    2012
  • fDate
    27-29 Aug. 2012
  • Firstpage
    268
  • Lastpage
    272
  • Abstract
    Paraphrase is widely researched in last decade. Most of the researches are focused on acquisition of paraphrase from various language resources and generation of paraphrase. It is a hot topic that how to build large scale of paraphrase corpus, and it is the first step for paraphrase exploration as well. Interactive question answering communities which are a kind of special Q&A platform skipping over natural language understood by computer but just providing a platform for communication among people, have corpus with quick growing rate and sentences in diversified expressions. These advantages provide great value for paraphrase research and extend paraphrase corpus in huge scale. We propose a method on how to extract paraphrase from interactive Q&A community in this paper. The experiment results show the precision, recall and f-measure can reach to 0.7725, 0.7349 and 0.7532 respectively, and paraphrase could be extracted effectively.
  • Keywords
    natural language processing; pattern classification; question answering (information retrieval); support vector machines; f-measure; interactive Q-and-A community; language resource; paraphrase acquisition; paraphrase corpus; paraphrase exploration; paraphrase extraction; paraphrase generation; precision measure; question-and-answer community; recall measure; Abstracts; Communities; Tin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Networking Technology (ICCNT), 2012 8th International Conference on
  • Conference_Location
    Gueongju
  • Print_ISBN
    978-1-4673-1326-1
  • Type

    conf

  • Filename
    6418665