DocumentCode :
2764471
Title :
Regularization of sequence data for machine learning
Author :
Bai, B. ; Kremer, S.C.
Author_Institution :
Sch. of Comput. Sci., Univ. of Guelph, Guelph, ON, Canada
fYear :
2011
fDate :
12-15 Nov. 2011
Firstpage :
19
Lastpage :
25
Abstract :
We examine the problem of classifying biological sequences, and in particular the challenge of generalizing results to novel input data. We observe that the high-dimensionality of sequence data representations results in an extremely sparsely populated input space. This motivates a need for regularization (a form of inductive bias), in order to achieve generalization. We discuss regularization in the context of regular neural networks, deep belief networks and support vector machines, and provide experimental results for these architectures. Our results support the importance of using an effective regularization method and identify which methods work well on a real-world dataset.
Keywords :
DNA; belief networks; bioinformatics; learning (artificial intelligence); neural nets; support vector machines; biological sequences; deep belief network; machine learning; neural network; sequence data regularization; sequence data representation; support vector machine; Complexity theory; DNA; Kernel; Learning systems; Machine learning; Support vector machines; Training; DNA barcoding; deep architecture; generalization; machine learning; neural network; non-monophyletic species; support vector machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1612-6
Type :
conf
DOI :
10.1109/BIBMW.2011.6112350
Filename :
6112350
Link To Document :
بازگشت