DocumentCode :
814030
Title :
Learning deterministic finite automata with a smart state labeling evolutionary algorithm
Author :
Lucas, Simon M. ; Reynolds, T. Jeff
Author_Institution :
Dept. of Comput. Sci., Essex Univ., Colchester, UK
Volume :
27
Issue :
7
fYear :
2005
fDate :
7/1/2005 12:00:00 AM
Firstpage :
1063
Lastpage :
1074
Abstract :
Learning a deterministic finite automaton (DFA) from a training set of labeled strings is a hard task that has been much studied within the machine learning community. It is equivalent to learning a regular language by example and has applications in language modeling. In this paper, we describe a novel evolutionary method for learning DFA that evolves only the transition matrix and uses a simple deterministic procedure to optimally assign state labels. We compare its performance with the evidence driven state merging (EDSM) algorithm, one of the most powerful known DFA learning algorithms. We present results on random DFA induction problems of varying target size and training set density. We also study the effects of noisy training data on the evolutionary approach and on EDSM. On noise-free data, we find that our evolutionary method outperforms EDSM on small sparse data sets. In the case of noisy training data, we find that our evolutionary method consistently outperforms EDSM, as well as other significant methods submitted to two recent competitions.
Keywords :
deterministic automata; evolutionary computation; finite automata; learning (artificial intelligence); matrix algebra; deterministic finite automata learning; evidence driven state merging algorithm; language modeling; machine learning community; smart state labeling evolutionary algorithm; transition matrix; Doped fiber amplifiers; Evolutionary computation; Labeling; Learning automata; Machine learning; Merging; Recurrent neural networks; Sampling methods; Testing; Training data; Index Terms- Grammatical inference; evolutionary algorithm.; finite state automata; random hill climber; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Information Storage and Retrieval; Models, Statistical; Natural Language Processing; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Sequence Alignment; Sequence Analysis; Signal Processing, Computer-Assisted;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2005.143
Filename :
1432740
Link To Document :
بازگشت