DocumentCode :
2547138
Title :
Parallel Annotation and Population: A Cross-Language Experience
Author :
Sarrafzadeh, Bahareh ; Shamsfard, Mehrnoush
Author_Institution :
Electr. & Comput. Eng. Dept, Shahid Behehsti Univ., Tehran
Volume :
2
fYear :
2009
fDate :
22-24 Jan. 2009
Firstpage :
116
Lastpage :
120
Abstract :
In recent years automatic ontology population (OP) from texts has emerged as a new field of application for knowledge acquisition techniques. In OP, the instances of an ontology classes will be extracted from text and added under the ontology concepts. On the other hand, semantic annotation which is a key task in moving toward semantic Web tries to tag instance data in a text by their corresponding ontology classes; so the ontology population activity accompanies generating semantic annotations usually. In this paper we introduce a cross-lingual population/annotation system called POPTA which annotates Persian texts according to an English lexicalized ontology and populates the English ontology according to the input Persian texts. It exploits a hybrid approach, a combination of statistical and pattern-based methods as well as techniques founded on the Web and search engines and a novel method of resolving translation ambiguities. POPTA also uses Wikipedia as a vast natural language encyclopedia to extract new instances to populate the input ontology.
Keywords :
language translation; linguistics; natural language processing; ontologies (artificial intelligence); statistical analysis; text analysis; English lexicalized ontology; POPTA system; Persian text; Wikipedia; World Wide Web; cross-language; cross-lingual population/annotation system; natural language encyclopedia; ontology population; parallel annotation; pattern-based method; search engine; statistical method; translation ambiguity; Concurrent computing; Data mining; Instruction sets; Knowledge acquisition; Knowledge engineering; Natural languages; Ontologies; Search engines; Semantic Web; Wikipedia; Googling; Wikipedia; cross-language processing; ontology population; semantic annotation; semantic web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Engineering and Technology, 2009. ICCET '09. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-3334-6
Type :
conf
DOI :
10.1109/ICCET.2009.92
Filename :
4769570
Link To Document :
بازگشت