Title :
Research on a Scalable Parallel Data Mining Algorithm
Author :
Wang, JinLin ; Chen, Xi ; Zhou, Kefa
Abstract :
Sequential pattern mining is an active field in the domain of knowledge discovery and has been widely studied for over a decade by data mining researchers. More and more, with the constant progress in hardware and software technologies, real-world applications like network monitoring systems or sensor grids generate huge amount of streaming data. These works need an efficient and scalable parallel algorithm. On the basis of the widespread problem in current sequential pattern data mining algorithm and researching the data mining algorithm of serial sequential pattern, this paper proposes sequential patterns based and projection database based algorithm for scalable parallel sequential patterns data mining algorithm. Through theoretical analysis and experimental verification, the parallel data mining algorithm can well reduce the computational and spatial complexity and improve the efficiency of data mining in massive data circumstances.
Keywords :
computational complexity; data mining; parallel algorithms; computational complexity; knowledge discovery; network monitoring systems; parallel algorithm; projection database based algorithm; scalable parallel data mining algorithm; sensor grids; sequential pattern mining; spatial complexity; Algorithm design and analysis; Application software; Concurrent computing; Data mining; Hardware; Mesh generation; Monitoring; Parallel algorithms; Sensor systems and applications; Spatial databases; data mining; parallel algorithm; sequential patterns;
Conference_Titel :
INC, IMS and IDC, 2009. NCM '09. Fifth International Joint Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-5209-5
Electronic_ISBN :
978-0-7695-3769-6
DOI :
10.1109/NCM.2009.330