Title :
Noun ontology generation from Wikipedia article using Map Reduce with pattern based approach
Author :
Santoso, Joan ; Nugraha, James Nakoda ; Yuniarno, Eko Mulyanto ; Hariadi, Mochamad
Author_Institution :
Dept. of Electr. Eng., Inst. Teknol. Sepuluh November, Surabaya, Indonesia
Abstract :
Recently, data on the internet grows and it can be used as supporting information for human life. Wikipedia as an online encyclopedia provides many resources, data, and information on the internet. Main problem in our research is how to represent information from Indonesian Wikipedia article into some knowledge representation such as ontology. Ontology is a set of related concept and relation between those concepts. Ontology usually has a large and complex structure because ontology is made to cover a large area topic. Our approach in this ontology building is focused on hyponymy relation and meronymy relation. Our proposed method is using taxonomy template information in Wikipedia to extract hyponymy relation and some pattern to extract the meronymy relation. Our experiment shows that hyponymy relation can be extracted into 5038 relations. For our meronymy relation extraction process has 82.23% as the highest accuracy.
Keywords :
Internet; data handling; encyclopaedias; natural language processing; ontologies (artificial intelligence); parallel processing; Indonesian Wikipedia article; Internet; MapReduce; hyponymy relation extraction; knowledge representation; meronymy relation extraction; noun ontology generation; online encyclopedia; pattern based approach; taxonomy template information; Accuracy; Electronic publishing; Encyclopedias; Internet; Ontologies; Taxonomy; Big Data; Indonesian Language; Map Reduce; Natural Language Processing; Noun Ontology Building;
Conference_Titel :
Intelligent Technology and Its Applications (ISITIA), 2015 International Seminar on
Conference_Location :
Surabaya
Print_ISBN :
978-1-4799-7710-9
DOI :
10.1109/ISITIA.2015.7220009