DocumentCode
3110236
Title
Gene-finding as an Attribute Selection Task
Author
Borges, Helyane Bronoski ; Nievola, Julio Cesar
Author_Institution
Pontificia Univ. Catolica do Parana, Curitiba
fYear
2007
fDate
11-13 July 2007
Firstpage
537
Lastpage
542
Abstract
For data miners, bioinformatics pose a most demanding challenge than only creating efficient algorithms. They should work with databases that are more "horizontal" than "vertical", as the data consist of a few samples of a large (sometimes huge) number of attributes in the case of micro-arrays. More important is the fact that there is a priori biological knowledge saying that only a few genes are normally linked to each characteristic exhibited by the individual. It allows one to use Attribute Selection to determine which attributes are more likely to induce the observable characteristic. In this paper a study on many configurations of attribute selection schemes is made on two typical bioinformatics datasets. The results show that sequential subset generation guarantees better results and reiterates the use of the wrapper approach to achieve better classification, despite its running time being larger than the filter approach.
Keywords
DNA; biology computing; data mining; genetics; pattern classification; DNA; attribute selection scheme; bioinformatics; data mining; genetics; microarray technology; pattern classification; Bioinformatics; Cancer; DNA; Data mining; Databases; Diseases; Filters; Genetics; Machine learning; Malignant tumors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Science, 2007. ICIS 2007. 6th IEEE/ACIS International Conference on
Conference_Location
Melbourne, Qld.
Print_ISBN
0-7695-2841-4
Type
conf
DOI
10.1109/ICIS.2007.104
Filename
4276437
Link To Document