DocumentCode :
3088406
Title :
Data Mixed-Extraction Strategy Based on the Time Characteristics in CDW
Author :
Zhang, Li-Na ; Liu, Jie ; Zhang, Yue
Author_Institution :
Vocational & Tech. Coll., Shenyang Normal Univ., Shenyang, China
fYear :
2010
fDate :
17-19 Sept. 2010
Firstpage :
1129
Lastpage :
1131
Abstract :
There is a great deal of general information and behavior information of customers stored in clickstream data warehouse, so it has lots of data sources. Data extraction technologies such as traditional web server logs and packet sniffer have many inadequacies in the extraction efficiency and accuracy. According to the actual needs of OLAP and DM, this dissertation proposes a hybrid data extraction strategy based on time characteristics in the server layer, which makes up the shortcomings of clickstream data such as data incompleteness by combining clickstream data and external data sources.
Keywords :
Internet; data mining; data warehouses; CDW; DM; OLAP; Web server logs; clickstream data warehouse; data incompleteness; data mining; data mixed-extraction strategy; dissertation proposes; packet sniffer; Accuracy; Data mining; Data warehouses; IP networks; Time domain analysis; Web server; Clickstream; Data mixed-extraction strategy; Data warehouse; Time characteristics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pervasive Computing Signal Processing and Applications (PCSPA), 2010 First International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-8043-2
Electronic_ISBN :
978-0-7695-4180-8
Type :
conf
DOI :
10.1109/PCSPA.2010.278
Filename :
5635894
Link To Document :
بازگشت