DocumentCode
2418513
Title
Discourse marker generation and syntactic aggregation in Bengali text generation
Author
Das, Sumit ; Basu, Anupam ; Sarkar, Sudeshna
Author_Institution
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Kharagpur, India
fYear
2010
fDate
3-4 April 2010
Firstpage
305
Lastpage
311
Abstract
In discourse, the elementary text spans are semantically connected by coherence relations. Discourse markers linguistically realize the coherence relations in the surface form. On the other hand, by text aggregation, redundant entities are eliminated, resulting in more fluent, coherent, and concise text. For any but the most application of text generation, appropriate discourse marker selection and text aggregation are two important aspects for coherent text generation. In this paper, we explore the prevalent syntactic aggregation constructs in Bengali and present a rule based approach towards generating Bengali compound sentences using the identified constructs. We present a user based evaluation to validate our approach. At the end, we have also given an outline of a corpus based approach for generating suitable discourse marker in Bengali.
Keywords
natural language processing; text analysis; Bengali text generation; discourse marker generation; elementary text spans; rule based approach; syntactic aggregation; text aggregation; text generation; user based evaluation;
fLanguage
English
Publisher
ieee
Conference_Titel
Students' Technology Symposium (TechSym), 2010 IEEE
Conference_Location
Kharagpur
Print_ISBN
978-1-4244-5975-9
Electronic_ISBN
978-1-4244-5974-2
Type
conf
DOI
10.1109/TECHSYM.2010.5469163
Filename
5469163
Link To Document