Title :
A Study on the Mining Algorithm of Fast Association Rules for the XML Data
Author_Institution :
Coll. of Comput. Sci. & Inf. Eng., Zhejiang Gongshang Univ., Hangzhou
fDate :
Aug. 29 2008-Sept. 2 2008
Abstract :
It presents an efficient mining algorithm FreqtTree for discovering all frequent patterns from XML data, and then considers mining global frequent patterns from XML data in distributed environment in this paper. First of all, the XML files are transferred to DOM tree, and then it mines all the frequent patterns from the DOM tree. It´s a high efficient algorithm because it adopts the right extension technology and scans the DOM tree only one time. After that, it describes the distributed association rule data mining algorithm DFreqtTree based on DOM tree. At last, this algorithm is implemented and analyzed by Java language.
Keywords :
Java; XML; data mining; DFreqtTree; Java language; XML data; data mining algorithm; distributed association rule; distributed environment; fast association rules; global frequent patterns mining; mining algorithm; Algorithm design and analysis; Association rules; Computer science; Data engineering; Data mining; Educational institutions; Information technology; Java; Pattern matching; XML;
Conference_Titel :
Computer Science and Information Technology, 2008. ICCSIT '08. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-0-7695-3308-7
DOI :
10.1109/ICCSIT.2008.89