DocumentCode
3457754
Title
The FP-Tree Algorithm Used for Data Stream
Author
Tu, Qiang ; Lu, Jian-Feng ; Tang, Jiu-Bin ; Yang, Jing-Yu
Author_Institution
Dept. Of Comput. Sci., Nanjing Univ. Of Sci. & Technol., Nanjing, China
fYear
2010
fDate
21-23 Oct. 2010
Firstpage
1
Lastpage
5
Abstract
FP-growth is a classical algorithm in frequent pattern mining, which is often used in static data mining. Some researches have been done on using FP-growth algorithm to analyze streaming data. However, the double-scan-of-database manner in FP-tree creation is a serious bottleneck in streaming data analysis. Sliding window technique could solve this problem in certain degree, but it still can lead to the inaccuracy of FP-tree creation, which may impact the consequent data mining. In this paper, a new FP-tree algorithm is presented for streaming data, which creates the FP-tree by a single-pass scanning (SPSFP) throughout the database. Compared with the traditional FP-tree creation, the new method scans the database only once and doesn´t need to store the whole data set into memory, which not only saves the memory space but also makes it possible to mine frequent pattern accurately in streaming data. Furthermore, the time cost of the new algorithm is almost equivalent to the traditional one.
Keywords
data analysis; data mining; database management systems; trees (mathematics); FP growth algorithm; FP tree algorithm; data streaming analysis; double-scan-of-database; pattern mining; single pass scanning; sliding window technique; static data mining; Algorithm design and analysis; Data mining; Databases; Electronic mail; Information science; Next generation networking; Presses;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (CCPR), 2010 Chinese Conference on
Conference_Location
Chongqing
Print_ISBN
978-1-4244-7209-3
Electronic_ISBN
978-1-4244-7210-9
Type
conf
DOI
10.1109/CCPR.2010.5659233
Filename
5659233
Link To Document