• DocumentCode
    3722786
  • Title

    Improving Vietnamese Sentence Compression by Segmenting Meaning Chunks

  • Author

    Nhi-Thao Tran;Van-Giau Ung;An-Vinh Luong;Minh-Quoc Nghiem;Ngan Luu-Thuy Nguyen

  • Author_Institution
    Fac. of Inf. Technol., Ho Chi Minh City Univ. of Sci., Ho Chi Minh City, Vietnam
  • fYear
    2015
  • Firstpage
    320
  • Lastpage
    323
  • Abstract
    This paper proposes an approach for sentence compression that only requires the part-of-speech information. The method is based on an observation of the human compression: adjacent words which form a meaning chunk usually are removed or retained together. We incorporate meaning chunk as a feature for a CRF-based sequence labeling system. Experimental results on English and Vietnamese compression datasets show that the proposed approach achieved better performance than the state-of-the-art systems.
  • Keywords
    "Bayes methods","Cities and towns","Labeling","Tagging","Manuals","Information technology","Biological system modeling"
  • Publisher
    ieee
  • Conference_Titel
    Knowledge and Systems Engineering (KSE), 2015 Seventh International Conference on
  • Type

    conf

  • DOI
    10.1109/KSE.2015.74
  • Filename
    7371804