Title :
Using genetic algorithm for Persian grammar induction
Author :
Arabsorkhi, Mohsen ; Faili, Hesham ; Jahroumi, Mansoor Zolghadri
Author_Institution :
Comput. Eng. Dept., Islamic Azad Univ. of Saveh, Saveh, Iran
Abstract :
Most of efficient computational approaches in NLP tasks are supervised methods which need annotated corpora. But the lack of supervised data in Persian encourages researchers to increase their interests and efforts on unsupervised and semi-supervised approaches. This paper presents a novel semi-supervised approach which called Genetic-based inside-outside (GIO), for Persian grammar inference for inducing a grammar model in a PCFG formalism. GIO is an extension of the inside-outside algorithm enriched by some notions of genetic algorithm. In pure genetic algorithm for grammar induction, randomly generated initial population make it computationally expensive, so we used inside-outside algorithm to generate initial population. Our experiments show that our approach´s result is better than other applied methods for Persian grammar induction.
Keywords :
computational linguistics; genetic algorithms; grammars; natural language processing; PCFG formalism; Persian grammar induction; Persian grammar inference; genetic algorithm; genetic-based inside-outside; initial population random generation; natural language processing; semi-supervised approach; Computer science; Genetic algorithms; Genetic engineering; Hidden Markov models; Induction generators; Inference algorithms; Iterative methods; Natural languages; Statistical analysis; Tagging; Grammar induction; Persian grammar; genetic algorithm; inside-outside algorithm;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2009. NLP-KE 2009. International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-4538-7
Electronic_ISBN :
978-1-4244-4540-0
DOI :
10.1109/NLPKE.2009.5313851