Title :
Entity Resolution with Attribute and Connection Graph
Author :
Niu, Lingfeng ; Jianmin Wu ; Shi, Yong
Author_Institution :
Res. Center on Fictitious Econ. & Data Sci., Beijing, China
Abstract :
Entity resolution is a problem that arises in many information integration scenarios, which is to resolve the underlying entity that occurs in the data with the same or different surface forms. Besides the attributes information for each entity, there are also link or connection graph among the entities in most of the real world data. In this work, we propose an unsupervised entity resolution algorithm that utilizes both the attributes and connection graph of the entity. The algorithm propagates the similarity of each entity pair based on the connection graph in the similar way as in Page Rank. Instead of using even distribution for random access component in Page Rank, the attributes similarity of each entity pair is used to express that more similar entity pair in attributes should have larger Page Rank score. Experimental results on user profile matching on different social networking websites demonstrate the effectiveness of our new solution.
Keywords :
data integration; graph theory; social networking (online); PageRank; attribute graph; connection graph; entity resolution; information integration scenarios; random access component; social networking websites; unsupervised entity resolution algorithm; Conferences; Data mining; Educational institutions; Electronic mail; Social network services; USA Councils; Entity Resolution; PageRank; Social Networks; User matching;
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
DOI :
10.1109/ICDMW.2011.75