Title :
Data distributions automatic identification based on SOM and support vector machines
Author :
Zhu, Jia-yuan ; Zhang, Heng-xi ; Guo, Ji-lian ; Feng, Jing-lei
Author_Institution :
Dept. of Aeronaut. Mech. Eng., Air Force Eng. Univ., Xi´´an, China
Abstract :
It is very important to identify probability distributions fast and efficiently in data analysis. The paper analyzes data distributions automatic identification using a combined structure mode via self-organizing map and support vector machines. First, the paper sets up data distributions identification training sets, which are based on summary statistics including kurtosis, skewness, quantile and cumulative probability. Then, different data distributions are clustered using a self-organizing map. Furthermore, the clusters are learned and classified respectively using support vector machines. Finally, identification of random data distribution time series is tested in combined structure mode. The results indicate that the approach of the paper is feasible and efficient for automatically identifying data distributions in comparison with other methods.
Keywords :
learning automata; pattern classification; pattern clustering; probability; self-organising feature maps; statistical analysis; time series; SOM; automatic identification; clustering; combined structure mode; cumulative probability; data distributions; kurtosis; pattern identification; probability distributions; quantile probability; self-organizing map; skewness; support vector machines; training sets; Data analysis; Learning systems; Machine learning; Mechanical engineering; Neural networks; Probability distribution; Statistical distributions; Support vector machine classification; Support vector machines; Testing;
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
DOI :
10.1109/ICMLC.2002.1176770