Title :
A session identification algorithm based on frame page and pagethreshold
Author :
Yuankang Fang ; Huang Zhiqiu
Author_Institution :
Inf. Sci. & Technol. Sch., Nanjing Univ. of Aeronaut. & Astronaut., Nanjing, China
Abstract :
Session identification is an important step in data processing of web log mining. To solve the defects in traditional session identification, an improved session identification algorithm was proposed. After identifying specific users, a great deal of frame pages were filtered, the relatively reasonable access time threshold for each page was made up according to contents of each page and all web structure and user´s session sets were identified by this threshold. Finally the algorithm was compared with the traditional methods of session identification by experiences, the higher rationality and effectiveness of it was proved.
Keywords :
Internet; data mining; Web log mining; Web structure; data processing; frame page; page threshold; session identification algorithm; Algorithm design and analysis; Conferences; HTML; Data preprocessing; Frame page; Session identification; Threshold; Web mining;
Conference_Titel :
Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-5537-9
DOI :
10.1109/ICCSIT.2010.5564697