Title :
Improving neural-based classification of databases with overlapped classes: The case of star/galaxy segregation
Author :
Gómez-Gil, Pilar ; López-Cruz, Omar ; Cruz-Martínez, Ana Bertha
Author_Institution :
Opt. & Electron., Nat. Inst. of Astrophys., Tonantzintla, Mexico
Abstract :
There are many real-life classification problems where class overlapping severely limits the classification accuracy. In these situations is difficult to build automatic classifiers that obtain good generalization performance. An interesting case is found in the separation of stars and galaxies, which arises in galactic or extragalactic studies. There are many astronomical analysis packages which deal with this problem; for example, the very popular package SExtractor (Source Extractor) has incorporated a multi-layer perceptron (MLP) neural network classifier. We believe that SExtractor performance is suitable for improvement. In our way for building a better classifier, we analyzed the behavior of MLP-based classifiers for this kind of data. In this paper we present an experiment where, using WEKA, we have automatically selected the best characteristics to discriminate galaxies from stars and automatically selected the topology of a MLP that best defined the decision region. Our classifier obtained slightly better results than SExtractor when compared to classifications obtained by a human expert, using less computational resources that SExtractor. However, we conclude that more specific information about the problem needs to be used to build a better separator of star/galaxies.
Keywords :
astronomy computing; database management systems; galaxies; multilayer perceptrons; pattern classification; software packages; stars; MLP neural network classifier; SExtractor package; Source Extractor package; WEKA; astronomical analysis packages; automatic classifier; class overlapping; classification accuracy; multilayer perceptron; neural-based database classification; star-galaxy segregation; Artificial neural networks; Astronomy; Data mining; Databases; Software; Testing; Training; SExtractor; WEKA; classification using MLP; design of classifiers; feature selection; galaxy/star separation;
Conference_Titel :
Circuits and Systems (LASCAS), 2012 IEEE Third Latin American Symposium on
Conference_Location :
Playa del Carmen
Print_ISBN :
978-1-4673-1207-3
DOI :
10.1109/LASCAS.2012.6180340