• DocumentCode
    120890
  • Title

    An approach to automatic text summarization using WordNet

  • Author

    Pal, Alok Ranjan ; Saha, D.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Coll. of Eng. & Manage., Kolaghat, India
  • fYear
    2014
  • fDate
    21-22 Feb. 2014
  • Firstpage
    1169
  • Lastpage
    1173
  • Abstract
    Text Summarization is the procedure by which the significant portions of a text are retrieved. Most of the approaches perform the summarization based on some hand tagged rules, such as format of the writing of a sentence, position of a sentence in the text, frequency of few particular words in a sentence etc. But according to different input sources, these pre-defined constraints greatly affect the result. The proposed approach performs the summarization task by unsupervised learning methodology. The importance of a sentence in an input text is evaluated by the help of Simplified Lesk algorithm. As an online semantic dictionary WordNet is used. First, this approach evaluates the weights of all the sentences of a text separately using the Simplified Lesk algorithm and arranges them in decreasing order according to their weights. Next, according to the given percentage of summarization, a particular number of sentences are selected from that ordered list. The proposed approach gives best results upto 50% summarization of the original text and gives satisfactory result even upto 25% summarization of the original text.
  • Keywords
    database management systems; information retrieval; text analysis; unsupervised learning; automatic text summarization; hand tagged rules; online semantic dictionary WordNet; simplified lesk algorithm; text retrieval; unsupervised learning methodology; Abstracts; Conferences; Data mining; Dictionaries; Presses; Semantics; Abstract; Automatic Text Summarization; Extract; Lesk algorithm; WordNet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advance Computing Conference (IACC), 2014 IEEE International
  • Conference_Location
    Gurgaon
  • Print_ISBN
    978-1-4799-2571-1
  • Type

    conf

  • DOI
    10.1109/IAdCC.2014.6779492
  • Filename
    6779492