Title :
Specific features of a converter of web documents from Bengali to Universal Networking Language
Author :
Ali, Mohamed ; Das, Jugal Krishna ; Al-Mamun, S. M Abdullah ; Choudhury, Mihir
Author_Institution :
Dept. of CSE, East West Univ., Dhaka
Abstract :
In this paper, we present a workable structure along with characteristic features of a subsystem that may become an integral part of a language server bridging Bengali and the Universal Networking Language (UNL). We try to assimilate the results of the research efforts of the UNL community and also of various machine translation projects. Vast information resources in different languages are available in the Internet, but the can not be shared (because of vastly due to the language barrier). And the UNL community is set to devise an effective and efficient system to diminish that barrier with an ultimate aim to allow automatic conversion of Web based resources in one member language to that in another member language. A good number of researchers in computational linguistics all over the world have already joined hands with the UNL initiators, and research groups representing most widely used natural languages are working intensively for the purpose. This paper is to demonstrate our pioneering efforts in the field of Bengali (Bangla). Here we here outline a possible Bangla-UNL dictionary, feature an annotation editor for Bangla texts, infer significant morphological, syntactic and semantic rules for parsing Bangla web documents in connection with conversion to the UNL, and show possible ways of future contribution towards the goal.
Keywords :
Internet; computational linguistics; document handling; language translation; natural language processing; Bangla Web documents; Internet; Universal Networking Language; Web based resources; automatic conversion; computational linguistics; language server; machine translation projects; natural languages; parsing; workable structure; Computational linguistics; Computer networks; Costs; Dictionaries; Information resources; Internet; Natural languages; Network servers; Scattering; Web server; Bangla-UNL Dictionary; Deconverter; Enconverter; Hyper graph; Morphological Analysis; Universal Networking Language (UNL); Universal Words (UW);
Conference_Titel :
Computer and Communication Engineering, 2008. ICCCE 2008. International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-1691-2
Electronic_ISBN :
978-1-4244-1692-9
DOI :
10.1109/ICCCE.2008.4580700