• DocumentCode
    3545578
  • Title

    A Grammatically Structured Noun Phrase Extractor for Vietnamese

  • Author

    Tuan-nguyen, Hoai-Duc ; Ho, Bao-Quoc ; Bui, Tuan-Dung ; Hoang, Minh-Chau

  • Author_Institution
    Dept. of IS, Univ. of Natural Sci., Ho Chi Minh City, Vietnam
  • fYear
    2012
  • fDate
    Feb. 27 2012-March 1 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Noun phrase (NP) extraction is a vital part of any Natural Language Processing (NLP) system. However, it would be much better if the system can also parse the grammar structure of the extracted NPs. Grammatically structured NP (GSNP) is helpful in many research fields (Conceptual Indexing, Syntactic variant generating, Nested NP identifying, etc). This paper introduces a system that extracts NPs from Vietnamese Documents and parses each NP into a tree representing its grammar structure. These trees, in one hand, can be saved as XML documents, and in the other hand, can be loaded from these XML documents by some particular Java classes.
  • Keywords
    XML; grammars; natural language processing; tree data structures; GSNP extraction; Java class; NLP system; Vietnamese document; XML document; grammar structure; grammatically structured noun phrase extractor; natural language processing; Educational institutions; Grammar; Learning systems; Measurement; Natural language processing; Tagging; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2012 IEEE RIVF International Conference on
  • Conference_Location
    Ho Chi Minh City
  • Print_ISBN
    978-1-4673-0307-1
  • Type

    conf

  • DOI
    10.1109/rivf.2012.6169837
  • Filename
    6169837