• DocumentCode
    571619
  • Title

    Measuring Similarity between Sentence Fragments

  • Author

    Guangyuan Huang ; Jianqiang Sheng

  • Author_Institution
    State-Province Joint Lab. of Digital Home Interactive Applic., Sun Yat-sen Univ., Guangzhou, China
  • Volume
    1
  • fYear
    2012
  • fDate
    26-27 Aug. 2012
  • Firstpage
    327
  • Lastpage
    330
  • Abstract
    Sentence fragment has a wide range of applications, such as short text mining, flow diagram search based on label similarity and so on. Existing methods aren´t entirely appropriate for measuring similarity between sentence fragments since they were originally designed for complete sentences or long texts. So we pay more attention to proper nouns which carry important information in sentence fragments. We then propose a novel measuring method applicable for sentence fragments or even short sentences. It calculates the similarity based on the edit distance model instead of traditional vector space model. Besides, manual weight factors are introduced in order to meet the needs of different situations. Our experiments demonstrate that our method outperforms existing methods.
  • Keywords
    data mining; natural language processing; text analysis; edit distance model; flow diagram search; label similarity; manual weight factors; measuring method; sentence fragments; short text mining; vector space model; Accuracy; Artificial intelligence; Humans; Joints; Semantics; Syntactics; Vectors; Sentence fragment; degree of matching; edit distance; measuring similarity; proper nouns;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2012 4th International Conference on
  • Conference_Location
    Nanchang, Jiangxi
  • Print_ISBN
    978-1-4673-1902-7
  • Type

    conf

  • DOI
    10.1109/IHMSC.2012.88
  • Filename
    6305692