Title :
Automatic Annotation of Non-English Web Content
Author :
Sevcech, Jakub ; Bielikov, M.
Author_Institution :
Inst. of Inf. & Software Eng., Slovak Univ. of Technol. in Bratislava, Bratislava, Slovakia
Abstract :
Nowadays we are facing the daily information overload. It is thus difficult to get exactly the information we need. It often happens that while reading, we find a word we do not understand and we would need an explanation or some additional information about this word. For this purpose annotations in the Web environment are created and attached to such words. In this paper we propose a method for an automatic extension of the content available on the Web by adding annotations to selected terms (keywords) in the text. The method is designed to be able to insert annotations into the text written in Slovak with a potential to be language independent. Annotations themselves are obtained through publicly available services providing information retrieval. We adapt created annotations taking into account implicit feedback from users in form of click through data. We evaluate the proposed method in the environment of an educational web-based system.
Keywords :
computer aided instruction; information retrieval; natural language processing; text analysis; Slovak text; Web content automatic extension; click through data; educational Web-based system; implicit feedback; information overload; information retrieval; language independent; nonEnglish Web content automatic annotation; Dictionaries; Encyclopedias; Internet; Shape; Web pages; Web annotation; adaptive annotations; keywords; keywords mapping;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location :
Lyon
Print_ISBN :
978-1-4577-1373-6
Electronic_ISBN :
978-0-7695-4513-4
DOI :
10.1109/WI-IAT.2011.219