DocumentCode :
3703590
Title :
Ensemble of deep long short term memory networks for labelling origin of replication sequences
Author :
Urminder Singh;Sucheta Chauhan;A. Krishnamachari;Lovekesh Vig
Author_Institution :
School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
fYear :
2015
Firstpage :
1
Lastpage :
7
Abstract :
Advancement in sequence data generation technologies are churning out voluminous omics data and posing a massive challenge to annotate the biological functional features. Sequence data from the well studied model organism Saccharomyces cerevisiae has been commonly used to test and validate in silico prediction methods. DNA replication is a critical step in the cellular process and the sequence location where this process originates in the genomic landscape is generally referred as origin of replication. In this paper we investigate the application bidirectional Long Short Term (LSTM) Networks to predict origin of replication sequences. Long Short Term Memory (LSTM) networks have recently been shown to yield state of the art performance in speech recognition, and music generation. These networks are capable of learning long term patterns via the use of multiplication gates. This paper utilizes Deep bidirectional LSTM for prediction of origin of replication sequences belonging to the organism Saccharomyces cerevisiae. Results demonstrate that LSTMs outperform the commonly used machine learning classifiers such as Support Vector Machine (SVM), Random Forest (RF), Artificial Neural Network (ANN), and Hidden Markov Model (HMM). An important additional advantage of LSTMs is that they work directly on the sequences and obviate the need for hand coded features.
Keywords :
"Logic gates","Genomics","Bioinformatics","DNA","Biological cells","Computer architecture","Hidden Markov models"
Publisher :
ieee
Conference_Titel :
Data Science and Advanced Analytics (DSAA), 2015. 36678 2015. IEEE International Conference on
Print_ISBN :
978-1-4673-8272-4
Type :
conf
DOI :
10.1109/DSAA.2015.7344871
Filename :
7344871
Link To Document :
بازگشت