• Title of article

    Lexical and Syntactic knowledge for Information Retrieval

  • Author/Authors

    Antonio Ferr?ndez، نويسنده ,

  • Issue Information
    دوماهنامه با شماره پیاپی سال 2011
  • Pages
    14
  • From page
    692
  • To page
    705
  • Abstract
    Traditional Information Retrieval (IR) models assume that the index terms of queries and documents are statistically independent of each other, which is intuitively wrong. This paper proposes the incorporation of the lexical and syntactic knowledge generated by a POS-tagger and a syntactic Chunker into traditional IR similarity measures for including this dependency information between terms. Our proposal is based on theories of discourse structure by means of the segmentation of documents and queries into sentences and entities. Therefore, we measure dependencies between entities instead of between terms. Moreover, we handle discourse references for each entity. It has been evaluated on Spanish and English corpora as well as on Question Answering tasks obtaining significant increases.
  • Keywords
    information retrieval , Term proximity , Question answering , Lexical and syntactic relationships , Natural language processing
  • Journal title
    Information Processing and Management
  • Serial Year
    2011
  • Journal title
    Information Processing and Management
  • Record number

    1229149