DocumentCode
174849
Title
An Efficient Algorithm for Hierarchical Classification of Protein and Gene Functions
Author
Fabris, Fabio ; Freitas, Alex A.
Author_Institution
Sch. of Comput., Univ. of Kent, Canterbury, UK
fYear
2014
fDate
1-5 Sept. 2014
Firstpage
64
Lastpage
68
Abstract
The classification of protein and gene functions is a complex problem that is becoming more relevant as the number of sequenced genes and proteins increases. This work presents a modified version of the Extended Local Hierarchical Naive Bayes algorithm, which exploits the requirements of the original algorithm (single-path, mandatory-leaf-prediction hierarchical classification problems in tree-structured class hierarchies) to greatly improve classification run-time. We show that, considering 18 hierarchical classification datasets, the modified algorithm yields equivalent predictive performance and significantly improves run-time in the training and prediction phases.
Keywords
Bayes methods; biology computing; genetics; pattern classification; proteins; classification run-time; extended local hierarchical naive Bayes algorithm; gene function classification; hierarchical classification datasets; prediction phases; predictive performance; protein function classification; sequenced genes; single-path mandatory-leaf-prediction hierarchical classification problems; training phases; tree-structured class hierarchies; Bioinformatics; Data mining; Prediction algorithms; Probability; Proteins; Time complexity; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications (DEXA), 2014 25th International Workshop on
Conference_Location
Munich
ISSN
1529-4188
Print_ISBN
978-1-4799-5721-7
Type
conf
DOI
10.1109/DEXA.2014.29
Filename
6974828
Link To Document