DocumentCode
1966617
Title
English-Malayalam Cross-Lingual Information Retrieval — an experience
Author
Nikesh, P.L. ; Idicula, Sumam Mary ; David Peter, S.
Author_Institution
Dept. of Comput. Sci., Cochin Univ. of Sci. & Technol., Kochi
fYear
2008
fDate
18-20 May 2008
Firstpage
271
Lastpage
275
Abstract
This paper describes about an English-Malayalam cross-lingual information retrieval system. The system retrieves Malayalam documents in response to query given in English or Malayalam. Thus monolingual information retrieval is also supported in this system. Malayalam is one of the most prominent regional languages of Indian subcontinent. It is spoken by more than 37 million people and is the native language of Kerala state in India. Since we neither had any full-fledged online bilingual dictionary nor any parallel corpora to build the statistical lexicon, we used a bilingual dictionary developed in house for translation. Other language specific resources like Malayalam stemmer, Malayalam morphological root analyzer etc developed in house were used in this work.
Keywords
dictionaries; linguistics; natural languages; query processing; English-Malayalam cross-lingual system; Malayalam morphological root analyzer; information retrieval system; online bilingual dictionary; query processing; regional language; Architecture; Computer science; Content based retrieval; Dictionaries; Encoding; Government; Information retrieval; Internet; Natural languages; Space technology; Bilingual dictionary; Content based retrieval; Cross-Lingual Information Retrieval; Document ranking; Malayalam; Vector space model;
fLanguage
English
Publisher
ieee
Conference_Titel
Electro/Information Technology, 2008. EIT 2008. IEEE International Conference on
Conference_Location
Ames, IA
Print_ISBN
978-1-4244-2029-2
Electronic_ISBN
978-1-4244-2030-8
Type
conf
DOI
10.1109/EIT.2008.4554312
Filename
4554312
Link To Document