Title :
Community Relation Discovery by Named Entities
Author :
Zhu, Jian-Han ; Goncalves, Alexandre L. ; Uren, Victoria S. ; Motta, Enrico ; Pacheco, Roberto ; Song, Da-Wei ; Rüger, Stefan
Author_Institution :
Open Univ., Milton Keynes
Abstract :
Discovering who works with whom, on which projects and with which customers is a key task in knowledge management. Although most organizations keep models of organizational structures, these models do not necessarily accurately reflect the reality on the ground. In this paper we present a text mining method called CORDER which first recognizes named entities (NEs) of various types from Web pages, and then discovers relations from a target NE to other NEs which co-occur with it. We evaluated the method on our departmental Website. We used the CORDER method to first find related NEs of four types (organizations, people, projects, and research areas) from Web pages on the Website and then rank them according to their co-occurrence with each of the people in our department. 20 representative people were selected and each of them was presented with ranked lists of each type of NE. Each person specified whether these NEs were related to him/her and changed or confirmed their rankings. Our results indicate that the method can find the NEs with which these people are closely related and provide accurate rankings.
Keywords :
data mining; knowledge management; text analysis; Web pages; community relation discovery; knowledge management; organizational structures; text mining method; Cybernetics; Data mining; Knowledge management; Machine learning; Supervised learning; Target recognition; Text mining; Text recognition; Training data; Web pages; Clustering; Named entity recognition; Ranking; Relation discovery; Similarities;
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
DOI :
10.1109/ICMLC.2007.4370469