Title :
English-Malayalam Cross-Lingual Information Retrieval — an experience
Author :
Nikesh, P.L. ; Idicula, Sumam Mary ; David Peter, S.
Author_Institution :
Dept. of Comput. Sci., Cochin Univ. of Sci. & Technol., Kochi
Abstract :
This paper describes about an English-Malayalam cross-lingual information retrieval system. The system retrieves Malayalam documents in response to query given in English or Malayalam. Thus monolingual information retrieval is also supported in this system. Malayalam is one of the most prominent regional languages of Indian subcontinent. It is spoken by more than 37 million people and is the native language of Kerala state in India. Since we neither had any full-fledged online bilingual dictionary nor any parallel corpora to build the statistical lexicon, we used a bilingual dictionary developed in house for translation. Other language specific resources like Malayalam stemmer, Malayalam morphological root analyzer etc developed in house were used in this work.
Keywords :
dictionaries; linguistics; natural languages; query processing; English-Malayalam cross-lingual system; Malayalam morphological root analyzer; information retrieval system; online bilingual dictionary; query processing; regional language; Architecture; Computer science; Content based retrieval; Dictionaries; Encoding; Government; Information retrieval; Internet; Natural languages; Space technology; Bilingual dictionary; Content based retrieval; Cross-Lingual Information Retrieval; Document ranking; Malayalam; Vector space model;
Conference_Titel :
Electro/Information Technology, 2008. EIT 2008. IEEE International Conference on
Conference_Location :
Ames, IA
Print_ISBN :
978-1-4244-2029-2
Electronic_ISBN :
978-1-4244-2030-8
DOI :
10.1109/EIT.2008.4554312