DocumentCode :
3696918
Title :
Entity Recognition and Relations Extraction Based on the Structure of Online Encyclopedia
Author :
Qing Song;Yue Yang
Author_Institution :
New Media Inst., Commun. Univ. of China, Beijing, China
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
478
Lastpage :
482
Abstract :
In order to construct the knowledge base in the field of journalism, this paper improved the knowledge representation framework of Freebase to make it more suitable for the knowledge in journalism domain. On the basis, we chose to extract entities, entity attributes and relationships between entities from Baidu Encyclopedia websites in order to analyze its structure. According to the infobox, the Character Relationship, the Relevant Characters and the Category Labels templates on the Baidu Encyclopedia webpages, we harvested entity triples (Entity1, Relation, Entity2). Then, we supplemented the characters relationship types through the Entity Relation template in Hudong Encyclopedia webpages. Through the natural language processing technology, the entity similarity algorithm, the association rules reasoning algorithm and other methods, we cleaned and supplemented the knowledge. After that, we stored the knowledge to the graph database according to the knowledge representation model. Finally, we got a better knowledge base in the field of journalism.
Keywords :
"Encyclopedias","Knowledge based systems","Uniform resource locators","Knowledge representation","Text recognition","Data mining"
Publisher :
ieee
Conference_Titel :
Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI), 2015 3rd International Conference on
Type :
conf
DOI :
10.1109/ACIT-CSI.2015.91
Filename :
7336111
Link To Document :
بازگشت