Title :
Integrating e-commerce and data mining: architecture and challenges
Author :
Ansari, Suhail ; Kohavi, Ron ; Mason, Llew ; Zheng, Zijian
Author_Institution :
Blue Martini Software, San Mateo, CA, USA
Abstract :
We show that the e-commerce domain can provide all the right ingredients for successful data mining. We describe an integrated architecture for supporting this integration. The architecture can dramatically reduce the pre-processing, cleaning, and data understanding effort often documented to take 80% of the time in knowledge discovery projects. We emphasize the need for data collection at the application server layer (not the Web server) in order to support logging of data and metadata that is essential to the discovery process. We describe the data transformation bridges required from the transaction processing systems and customer event streams (e.g., clickstreams) to the data warehouse. We detail the mining workbench, which needs to provide multiple views of the data through reporting, data mining algorithms, visualization, and OLAP. We conclude with a set of challenges
Keywords :
data mining; data visualisation; data warehouses; electronic commerce; information resources; meta data; transaction processing; OLAP; application server layer; clickstreams; customer event streams; data collection; data logging; data mining algorithms; data transformation bridges; data understanding; data warehouse; e-commerce domain; e-commerce/data mining integration; integrated architecture; knowledge discovery projects; metadata; mining workbench; multiple views; transaction processing systems; visualization; Bridges; Cleaning; Computer architecture; Data mining; Data visualization; Data warehouses; Fuels; User interfaces; Web pages; Web server;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989497