DocumentCode :
2683501
Title :
A New Approach To Accent Restoration Of Vietnamese Texts Using Dynamic Programming Combined With Co-Occurrence Graph
Author :
Nghia, Hoang Trong ; Phuc, Do
Author_Institution :
Dept. of Inf. Technol., Univ. of Natural Sci., Ho Chi Minh City, Vietnam
fYear :
2009
fDate :
13-17 July 2009
Firstpage :
1
Lastpage :
4
Abstract :
In this paper, we would like to introduce a new approach to recover Vietnamese text´s accents. Given a Vietnamese text in which accents are lost, our goal is to seek for a recovered text that yields a best lexical probability. Using a dynamic programming approach, we first build a model of language for Vietnamese as a lexical database which gives lexical probabilities to Vietnamese sentences. Second, we construct a map of literal translations of Vietnamese words to restrict our searching space. Finally, we apply dynamic programming as a searching engine to seek out the most probable sentence. We also use the co-occurrence graph to increase the accuracy of selection, the experimental results show that the average accuracy of our approach is about 93%-94%.
Keywords :
dynamic programming; graph theory; natural languages; probability; text analysis; Vietnamese texts; accent restoration; co-occurrence graph; dynamic programming; lexical database; lexical probability; recovered text; Context modeling; Databases; Dynamic programming; Information systems; Information technology; Natural languages; Search engines; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing and Communication Technologies, 2009. RIVF '09. International Conference on
Conference_Location :
Da Nang
Print_ISBN :
978-1-4244-4566-0
Electronic_ISBN :
978-1-4244-4568-4
Type :
conf
DOI :
10.1109/RIVF.2009.5174609
Filename :
5174609
Link To Document :
بازگشت