• DocumentCode
    2948971
  • Title

    Evaluation and classification of syntax information usage in determining short text semantic similarity

  • Author

    Batanovic, Vuk ; Bojic, Dejan

  • Author_Institution
    Elektroteh. Fak., Univ. u Beogradu, Belgrade, Serbia
  • fYear
    2013
  • fDate
    26-28 Nov. 2013
  • Firstpage
    821
  • Lastpage
    824
  • Abstract
    This paper outlines and categorizes ways of using syntax information in a number of algorithms for determining short text semantic similarity. Algorithm performance was evaluated using the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. Among the described algorithms and approaches to using syntax information we identify those best suited for application in languages with limited electronic linguistic tools and, with that goal in mind, we propose a new algorithm classification.
  • Keywords
    computational linguistics; natural language processing; pattern classification; text analysis; Microsoft Research Paraphrase Corpus; algorithm performance evaluation; electronic linguistic tools; paraphrase detection test; short-text semantic similarity determination; syntax information usage classification; syntax information usage evaluation; Coal; Computational linguistics; Electronic mail; Knowledge discovery; Labeling; Semantics; Syntactics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications Forum (TELFOR), 2013 21st
  • Conference_Location
    Belgrade
  • Print_ISBN
    978-1-4799-1419-7
  • Type

    conf

  • DOI
    10.1109/TELFOR.2013.6716356
  • Filename
    6716356