Title :
Data Preprocessing Method Based on User Characteristic of Interests for Web Log Mining
Author :
Ying Han ; Kejian Xia
Author_Institution :
Coll. of Comput. & Commun. Eng., Univ. of Sci. & Technol. in Beijing, Beijing, China
Abstract :
Web log mining is the most important method in Web data mining, and data preprocessing is the primary work. In order to find more value access mode and reduce the data size from the Web, find the data of users and even between users, this paper puts forward a method of Web log data preprocessing based on user characteristic of interests, and then put forward some concepts such as user interest, user interest similarity. Finally, after some experiments, we can show the superiority and recommended value of this new method.
Keywords :
Internet; data mining; user interfaces; Web data mining; Web log data preprocessing method; Web log mining; World Wide Web; user characteristic; user interest similarity; value access mode; Computers; Data mining; Data models; Data preprocessing; Educational institutions; Servers; Vectors; characteristic of interests; data mining; data preprocessing; web log mining;
Conference_Titel :
Instrumentation and Measurement, Computer, Communication and Control (IMCCC), 2014 Fourth International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4799-6574-8
DOI :
10.1109/IMCCC.2014.182