DocumentCode :
3582808
Title :
A nondeterministic approach to infer context free grammar from sequence
Author :
Yuan Li ; Chen, Jim X.
Author_Institution :
Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
fYear :
2014
Firstpage :
1
Lastpage :
9
Abstract :
Grammar induction has received a lot of attention from researchers in the past decades because of its practical and theoretical impact on data compression, pattern discovery and computation theory. There are a bunch of grammar induction algorithms for a given sequence are introduced. Most existing work on learning grammar for a given sequence is based on deterministic approach. Such deterministic approaches used by grammar induction algorithms can be categorized as greedy heuristics. In addition, there are many grammars, which can be learned from a given sequence. The smallest grammar problem is defined by some researchers to evaluate different grammars learned from a given sequence by different algorithms. Such problem is proved as NP-hard. In this work, we introduce a nondeterministic approach to address grammar induction for a given sequence based on genetic algorithm. We demonstrate that our grammar induction algorithm can effectively identify smaller grammar than a well-known grammar induction algorithm. Experimental results, which are presented, illustrate that our approach and algorithm are feasible to resolve difficult problems such as identifying patterns of DNA sequence.
Keywords :
computational complexity; context-free grammars; genetic algorithms; learning (artificial intelligence); sequences; DNA sequence; NP-hard problem; computation theory; context free grammar; data compression; deterministic approaches; genetic algorithm; grammar induction algorithm; greedy heuristics; learning grammar; nondeterministic approach; pattern discovery; Algorithm design and analysis; Data compression; Genetic algorithms; Grammar; Merging; Sociology; Statistics; Genetic Algorithm; Grammar Induction; Nondeterministic approach; Pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wavelet Active Media Technology and Information Processing (ICCWAMTIP), 2014 11th International Computer Conference on
Print_ISBN :
978-1-4799-7207-4
Type :
conf
DOI :
10.1109/ICCWAMTIP.2014.7073350
Filename :
7073350
Link To Document :
بازگشت