DocumentCode :
2501064
Title :
Some observations about Thai synonymous compounds from the BEST 2009 corpus
Author :
Phaholphinyo, Sitthaa ; Purodakananda, Sumonmas ; Kriengket, Kanyanut ; Kosawat, Krit
Author_Institution :
Human Language Technol. Lab., Nat. Electron. & Comput. Technol. Center, Pathum Thani, Thailand
fYear :
2009
fDate :
20-22 Oct. 2009
Firstpage :
194
Lastpage :
199
Abstract :
This research aims to analyse Thai synonymous compounds appearing in the BEST 2009 corpus in order to find out their structure. We selected only synonymous compound words which appear 100 times and more to analyse in 3 aspects: number of constituents, parts of speech and formation. The results show that Thai synonymous compounds comprised of 1-4 morphemes, 2-4 and 6 syllables and are categorized into 4 parts of speech and 16 POS structures. Moreover, the verb + verb structure and the synonymous compounds with same consonant sound and identical meaning appear most frequently. This research can be applied to a synonymous compound extracting machine to produce a synonymous compound dictionary.
Keywords :
natural languages; BEST 2009 corpus; Thai synonymous compounds; synonymous compound dictionary; synonymous compound extracting machine; Data analysis; Data mining; Dictionaries; Encyclopedias; Error analysis; Frequency conversion; Natural language processing; Rivers; Software standards; Speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing, 2009. SNLP '09. Eighth International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4244-4138-9
Electronic_ISBN :
978-1-4244-4139-6
Type :
conf
DOI :
10.1109/SNLP.2009.5340920
Filename :
5340920
Link To Document :
بازگشت