DocumentCode
2582563
Title
Density Estimation Technique for Data Stream Classification
Author
Kerdprasop, Nittaya ; Kerdprasop, Kittisak
Author_Institution
Sch. of Comput. Eng., Suranaree Univ. of Technol.
fYear
0
fDate
0-0 0
Firstpage
662
Lastpage
666
Abstract
Density estimation is an important pre-processing step in the problem of data stream classification in which the number of data is overwhelming and the exact data distribution is unknown. We simplify the problem by employing a statistical sampling technique to obtain an approximate solution. With the proposed method, an unbounded large data set can be sampled in a number of random configurations, and that data can be used to describe the data set as a whole. The efficiency of the method depends largely on the ability to draw samples effectively which in turn depends on how close we can estimate the target density. We use finite mixture models to represent the probability density functions of the data stream. Then, we apply the EM algorithm twice to learn the model parameters. The efficiency of our estimation technique has been shown in the experimental results
Keywords
data analysis; expectation-maximisation algorithm; pattern classification; probability; sampling methods; EM algorithm; data stream classification; density estimation; finite mixture models; probability density functions; statistical sampling; Algorithm design and analysis; Data analysis; Data engineering; Data mining; Distributed computing; Knowledge engineering; Parameter estimation; Performance analysis; Probability density function; Sampling methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications, 2006. DEXA '06. 17th International Workshop on
Conference_Location
Krakow
ISSN
1529-4188
Print_ISBN
0-7695-2641-1
Type
conf
DOI
10.1109/DEXA.2006.49
Filename
1698426
Link To Document