Title :
On generating synthetic database for classification
Author :
Lu, Hongjun ; Sung, Sam Y. ; Lu, Ying
Author_Institution :
Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
Abstract :
In order to evaluate the performance of the large number of classifiers that have been developed and are being developed, proper generation of data to model the realistic world is an important preprocess. In a real world situation, the data often obeys some common distributions. In this paper, the methods and distribution models adopted to generate the data are explained in detail
Keywords :
Poisson distribution; database management systems; database theory; exponential distribution; knowledge acquisition; pattern classification; classification; common distributions; distribution models; realistic world; synthetic database generation; Algorithm design and analysis; Benchmark testing; Classification algorithms; Data mining; Database systems; Distributed databases; Electronic mail; Information systems; Measurement; Mining industry;
Conference_Titel :
Systems, Man, and Cybernetics, 1996., IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-3280-6
DOI :
10.1109/ICSMC.1996.571294