• DocumentCode
    2418513
  • Title

    Discourse marker generation and syntactic aggregation in Bengali text generation

  • Author

    Das, Sumit ; Basu, Anupam ; Sarkar, Sudeshna

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Kharagpur, India
  • fYear
    2010
  • fDate
    3-4 April 2010
  • Firstpage
    305
  • Lastpage
    311
  • Abstract
    In discourse, the elementary text spans are semantically connected by coherence relations. Discourse markers linguistically realize the coherence relations in the surface form. On the other hand, by text aggregation, redundant entities are eliminated, resulting in more fluent, coherent, and concise text. For any but the most application of text generation, appropriate discourse marker selection and text aggregation are two important aspects for coherent text generation. In this paper, we explore the prevalent syntactic aggregation constructs in Bengali and present a rule based approach towards generating Bengali compound sentences using the identified constructs. We present a user based evaluation to validate our approach. At the end, we have also given an outline of a corpus based approach for generating suitable discourse marker in Bengali.
  • Keywords
    natural language processing; text analysis; Bengali text generation; discourse marker generation; elementary text spans; rule based approach; syntactic aggregation; text aggregation; text generation; user based evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Students' Technology Symposium (TechSym), 2010 IEEE
  • Conference_Location
    Kharagpur
  • Print_ISBN
    978-1-4244-5975-9
  • Electronic_ISBN
    978-1-4244-5974-2
  • Type

    conf

  • DOI
    10.1109/TECHSYM.2010.5469163
  • Filename
    5469163