DocumentCode
2830485
Title
Enhanced Query Expansion in English-Arabic CLIR
Author
Bellaachia, Abdelghani ; Amor-Tijani, Ghita
Author_Institution
Dept. of Comput. Sci., George Washington Univ., Washington, DC
fYear
2008
fDate
1-5 Sept. 2008
Firstpage
61
Lastpage
66
Abstract
Arabic is a language with a particularly large vocabulary rich in words with synonymous shades of meaning. Modern Standard Arabic, which is used in formal writings, is the ancient Arabic language incorporated with loanwords derived from foreign languages. Different synonyms and loanwords tend to be used in different writings. Indeed, the Arabic composition style tends to vary throughout the Arab countries (Abdelali, 2004). Relevant documents could be overlooked when the query terms are synonyms or related to the ones used in the document collection. This could deteriorate the performance of a cross lingual information retrieval (CLIR) system. Query expansion (QE) using the document collection is the usual approach taken to enrich translated queries with context related terms. In this study, QE is explored for an English-Arabic CLIR system in which English queries are used to search Arabic documents. A thesaurus-based disambiguation approach is applied to further optimize the effectiveness of that technique. Indeed, experimental results show that QE enhanced by disambiguation gives an improved effectiveness.
Keywords
language translation; natural language processing; query processing; thesauri; Arabic language; English language; cross lingual information retrieval system; document collection; enhanced query expansion; formal writings; thesaurus-based disambiguation; Application software; Computer science; Data mining; Databases; Expert systems; Information retrieval; Performance analysis; Query processing; Vocabulary; Writing; Cross Lingual Information Retrieval; Disambiguation; Query Expansion;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Application, 2008. DEXA '08. 19th International Workshop on
Conference_Location
Turin
ISSN
1529-4188
Print_ISBN
978-0-7695-3299-8
Type
conf
DOI
10.1109/DEXA.2008.52
Filename
4624692
Link To Document