Title :
Extracting Phrases in Vietnamese Document for Summary Generation
Author :
Le, Huong Thanh ; Sam, Rathany Chan ; Nguyen, Phuc Trong
Author_Institution :
Sch. of Inf. & Commun. Technol., Hanoi Univ. of Technol., Hanoi, Vietnam
Abstract :
This paper describes an approach to Vietnamese text summarization, concentrated on the discourse structure of the text. Based on characteristics of Vietnamese, we propose rules for segmenting text into elementary discourse units (edus) and for recognizing discourse relations between textual spans. The score of an edu is computed based on the discourse tree. The edus with highest scores are chosen to put in the summary. Experiments show that this method can give promising results.
Keywords :
computational linguistics; natural language processing; text analysis; Vietnamese; discourse tree; elementary discourse unit; phrase extraction; text summarization; textual span; Barium; Pragmatics; Presses; Satellites; Semantics; Syntactics; Text recognition; Vietnamese; discourse structure; rhetorical relation; text summarization;
Conference_Titel :
Asian Language Processing (IALP), 2010 International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9063-9
DOI :
10.1109/IALP.2010.8