DocumentCode :
2074421
Title :
Learning local languages and its application to protein /spl alpha/-chain identification
Author :
Yokomori, Takashi ; Ishida, Nobuyuki ; Kobayashi, Satoshi
Author_Institution :
Dept. of Comput Sci. & Inf. Math., Univ. of Electro-Commun., Chofu, Japan
Volume :
5
fYear :
1994
fDate :
4-7 Jan. 1994
Firstpage :
113
Lastpage :
122
Abstract :
Concerns an efficient algorithm for learning in the limit a special type of regular language called a locally testable language from positive data, and its application to identifying the protein /spl alpha/-chain region in amino acid sequences. First, we present a linear-time algorithm that, given a locally testable language, learns (identifies) its deterministic finite state automaton in the limit from only positive data. This provides a practical and efficient learning method for a specific domain of symbolic analysis. We then describe several experimental results using the learning algorithm. Following a theoretical observation which strongly suggests that a certain type of amino acid sequence can be expressed by a locally testable language, we apply the learning algorithm to identifying the protein /spl alpha/-chain region in amino acid sequences for hemoglobin. Experimental scores show an overall success rate of 95% correct identification for positive data and 96% for negative data.<>
Keywords :
biology computing; computational complexity; deterministic automata; finite automata; formal languages; learning (artificial intelligence); macromolecular configurations; proteins; amino acid sequences; deterministic finite state automaton; hemoglobin; learning method; linear time algorithm; locally testable language; negative data; positive data; protein /spl alpha/-chain identification; regular language; symbolic analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 1994. Proceedings of the Twenty-Seventh Hawaii International Conference on
Conference_Location :
Wailea, HI, USA
Print_ISBN :
0-8186-5090-7
Type :
conf
DOI :
10.1109/HICSS.1994.323560
Filename :
323560
Link To Document :
بازگشت