Title :
Unsupervised learning of agglutinated morphology using nested Pitman-Yor process based morpheme induction algorithm
Author :
Arun Kumar;Llu?s Padr?;Antoni Oliver
Author_Institution :
Universitat Oberta de Catalunya, Barcelona, Spain
Abstract :
In this paper we describe a method of morphologically segment highly agglutinating and inflectional languages from the Dravidian family. We use the nested Pitman-Yor process to segment long agglutinated words into their basic components, and use a corpus based morpheme induction algorithm to perform morpheme segmentation. We test our method on two languages, Malayalam and Kannada and compare the results with Morfessor-baseline.
Keywords :
"Computational modeling","Morphology","Biological system modeling"
Conference_Titel :
Asian Language Processing (IALP), 2015 International Conference on
Print_ISBN :
978-1-4673-9595-3
DOI :
10.1109/IALP.2015.7451528