• DocumentCode
    2830485
  • Title

    Enhanced Query Expansion in English-Arabic CLIR

  • Author

    Bellaachia, Abdelghani ; Amor-Tijani, Ghita

  • Author_Institution
    Dept. of Comput. Sci., George Washington Univ., Washington, DC
  • fYear
    2008
  • fDate
    1-5 Sept. 2008
  • Firstpage
    61
  • Lastpage
    66
  • Abstract
    Arabic is a language with a particularly large vocabulary rich in words with synonymous shades of meaning. Modern Standard Arabic, which is used in formal writings, is the ancient Arabic language incorporated with loanwords derived from foreign languages. Different synonyms and loanwords tend to be used in different writings. Indeed, the Arabic composition style tends to vary throughout the Arab countries (Abdelali, 2004). Relevant documents could be overlooked when the query terms are synonyms or related to the ones used in the document collection. This could deteriorate the performance of a cross lingual information retrieval (CLIR) system. Query expansion (QE) using the document collection is the usual approach taken to enrich translated queries with context related terms. In this study, QE is explored for an English-Arabic CLIR system in which English queries are used to search Arabic documents. A thesaurus-based disambiguation approach is applied to further optimize the effectiveness of that technique. Indeed, experimental results show that QE enhanced by disambiguation gives an improved effectiveness.
  • Keywords
    language translation; natural language processing; query processing; thesauri; Arabic language; English language; cross lingual information retrieval system; document collection; enhanced query expansion; formal writings; thesaurus-based disambiguation; Application software; Computer science; Data mining; Databases; Expert systems; Information retrieval; Performance analysis; Query processing; Vocabulary; Writing; Cross Lingual Information Retrieval; Disambiguation; Query Expansion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Application, 2008. DEXA '08. 19th International Workshop on
  • Conference_Location
    Turin
  • ISSN
    1529-4188
  • Print_ISBN
    978-0-7695-3299-8
  • Type

    conf

  • DOI
    10.1109/DEXA.2008.52
  • Filename
    4624692