• DocumentCode
    3467385
  • Title

    Comparing the Contribution of Syntactic and Semantic Features in Closed versus Open Domain Question Answering

  • Author

    Lamjiri, Abolfazl Keighobadi ; Kosseim, Leila ; Radhakrishnan, Thiruvengadam

  • Author_Institution
    Concordia Univ., Montreal
  • fYear
    2007
  • fDate
    17-19 Sept. 2007
  • Firstpage
    679
  • Lastpage
    685
  • Abstract
    In this paper we analyze the contribution of semantic, syntactic and word similarity of document features in closed and open domain question answering. Semantic similarity is computed as the similarity of the action in the candidate sentence to the action asked in the question, measured using WordNet::Similarity on main verbs. The syntactic similarity feature measures the unifiability of a candidate´s parse tree with the question´s parse tree. It uses syntactic restrictions as well as lexical measures to compute the unifiability of critical syntactic participants in the parse trees. Finally, the word similarity of the document containing a candidate sentence is computed as the cosine of the angle between the question keywords vector and the document vector. Since the semantic feature is more reliable on content verbs and syntactic similarity is suitable for questions with a subject- verb-object syntactic structure, we only consider questions with a main content verb in our analysis (non-copulative questions). This type comprise 70% of our closed domain and 33% of our open domain test questions. The combination of these three features achieves an MRR of 28% in our closed domain and 23% in open domain. Our analysis shows that the syntactic feature has a significant contribution in both open and closed domains. However, the path-based lch semantic similarity measure we used, only contributes in our closed domain probably because of less variation in the vocabulary and topic. Document IR score on the other hand, has more contribution in open domain, because query keywords are more discriminating in a large document set with a vast vocabulary range.
  • Keywords
    information retrieval; natural language processing; text analysis; vocabulary; WordNet; closed-open domain question answering; document IR; natural language processing; parse tree; syntactic-semantic similarity feature; vocabulary; Computer science; Customer service; Data mining; Natural languages; Particle measurements; Redundancy; Software engineering; Testing; Vocabulary; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing, 2007. ICSC 2007. International Conference on
  • Conference_Location
    Irvine, CA
  • Print_ISBN
    978-0-7695-2997-4
  • Type

    conf

  • DOI
    10.1109/ICSC.2007.49
  • Filename
    4338410