DocumentCode :
3590868
Title :
Corpus based learning of stochastic context-free grammar combined with hidden Markov models for tRNA modelling
Author :
Garc?­a-G?³mez, Juan Miguel ; Benedi, Jose Miguel
Author_Institution :
Informatica Medica-BET, Politecnico de Valencia, Spain
Volume :
1
fYear :
2004
Firstpage :
2785
Lastpage :
2788
Abstract :
tRNA molecule has a well-known second structure in which it folds by pairing of far-off nucleotides. This paper shows a syntactic pattern recognition methodology for model tRNA second structure using stochastic context-free grammars. In order to learn models, structural regions (paired nucleotides) have been learned from categorized samples with full labelled tree with a Corpus based estimation algorithm. Nonstructural regions have been modelled by hidden Markov models and transformed to stochastic regular grammars to fusion together the structural regions. Test with positive samples and negative samples in comparison with Sakakibara achieved 1.81% in sequences error rate, 98.43% in precision and 100% in recall and 100% of SER in negative test. Corpus based algorithm is computational time efficient and required less training samples for converge to the correct model of the tRNA second structure.
Keywords :
biology computing; context-free grammars; hidden Markov models; inference mechanisms; learning (artificial intelligence); macromolecules; molecular biophysics; molecular configurations; organic compounds; pattern recognition; physiological models; Corpus based estimation algorithm; Corpus based learning; full labelled tree; grammatical inference; hidden Markov models; language modelling; nucleotides; stochastic context-free grammar; stochastic regular grammars; syntactic pattern recognition; tRNA folding; tRNA modelling; tRNA second structure; Arm; Context modeling; Error analysis; Hidden Markov models; Inference algorithms; Pattern recognition; Probability; RNA; Stochastic processes; Testing; RNA; grammatical inference; language modelling; stochastic context-free grammars; syntactic pattern recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 2004. IEMBS '04. 26th Annual International Conference of the IEEE
Print_ISBN :
0-7803-8439-3
Type :
conf
DOI :
10.1109/IEMBS.2004.1403796
Filename :
1403796
Link To Document :
بازگشت