Title :
An automatic data mining authority control system: A first approach
Author :
Díaz-Valenzuela, Irene ; Martín-Bautista, María J. ; Vila, M. Amparo
fDate :
Nov. 29 2010-Dec. 1 2010
Abstract :
In this paper we present an automatic authority control system for raw noisy web data based on Data Mining. We use a hierarchical clustering approach with a special distance measure combination of three parameters: author name similarity, token similarity and co-authors similarity, each one defined in a specific way. A preliminary experimental study has been performed with real data obtained from CiteSeerX.
Keywords :
bibliographic systems; data mining; pattern clustering; CiteSeerX; Web data; author name similarity; automatic data mining authority control system; co-authors similarity; hierarchical clustering; token similarity; authority control; clustering; data mining; raw web data;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-8134-7
DOI :
10.1109/ISDA.2010.5687205