DocumentCode :
2063737
Title :
An automatic data mining authority control system: A first approach
Author :
Díaz-Valenzuela, Irene ; Martín-Bautista, María J. ; Vila, M. Amparo
fYear :
2010
fDate :
Nov. 29 2010-Dec. 1 2010
Firstpage :
569
Lastpage :
574
Abstract :
In this paper we present an automatic authority control system for raw noisy web data based on Data Mining. We use a hierarchical clustering approach with a special distance measure combination of three parameters: author name similarity, token similarity and co-authors similarity, each one defined in a specific way. A preliminary experimental study has been performed with real data obtained from CiteSeerX.
Keywords :
bibliographic systems; data mining; pattern clustering; CiteSeerX; Web data; author name similarity; automatic data mining authority control system; co-authors similarity; hierarchical clustering; token similarity; authority control; clustering; data mining; raw web data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-8134-7
Type :
conf
DOI :
10.1109/ISDA.2010.5687205
Filename :
5687205
Link To Document :
بازگشت