DocumentCode
467814
Title
Community Relation Discovery by Named Entities
Author
Zhu, Jian-Han ; Goncalves, Alexandre L. ; Uren, Victoria S. ; Motta, Enrico ; Pacheco, Roberto ; Song, Da-Wei ; Rüger, Stefan
Author_Institution
Open Univ., Milton Keynes
Volume
4
fYear
2007
fDate
19-22 Aug. 2007
Firstpage
1966
Lastpage
1973
Abstract
Discovering who works with whom, on which projects and with which customers is a key task in knowledge management. Although most organizations keep models of organizational structures, these models do not necessarily accurately reflect the reality on the ground. In this paper we present a text mining method called CORDER which first recognizes named entities (NEs) of various types from Web pages, and then discovers relations from a target NE to other NEs which co-occur with it. We evaluated the method on our departmental Website. We used the CORDER method to first find related NEs of four types (organizations, people, projects, and research areas) from Web pages on the Website and then rank them according to their co-occurrence with each of the people in our department. 20 representative people were selected and each of them was presented with ranked lists of each type of NE. Each person specified whether these NEs were related to him/her and changed or confirmed their rankings. Our results indicate that the method can find the NEs with which these people are closely related and provide accurate rankings.
Keywords
data mining; knowledge management; text analysis; Web pages; community relation discovery; knowledge management; organizational structures; text mining method; Cybernetics; Data mining; Knowledge management; Machine learning; Supervised learning; Target recognition; Text mining; Text recognition; Training data; Web pages; Clustering; Named entity recognition; Ranking; Relation discovery; Similarities;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location
Hong Kong
Print_ISBN
978-1-4244-0973-0
Electronic_ISBN
978-1-4244-0973-0
Type
conf
DOI
10.1109/ICMLC.2007.4370469
Filename
4370469
Link To Document