DocumentCode
120890
Title
An approach to automatic text summarization using WordNet
Author
Pal, Alok Ranjan ; Saha, D.
Author_Institution
Dept. of Comput. Sci. & Eng., Coll. of Eng. & Manage., Kolaghat, India
fYear
2014
fDate
21-22 Feb. 2014
Firstpage
1169
Lastpage
1173
Abstract
Text Summarization is the procedure by which the significant portions of a text are retrieved. Most of the approaches perform the summarization based on some hand tagged rules, such as format of the writing of a sentence, position of a sentence in the text, frequency of few particular words in a sentence etc. But according to different input sources, these pre-defined constraints greatly affect the result. The proposed approach performs the summarization task by unsupervised learning methodology. The importance of a sentence in an input text is evaluated by the help of Simplified Lesk algorithm. As an online semantic dictionary WordNet is used. First, this approach evaluates the weights of all the sentences of a text separately using the Simplified Lesk algorithm and arranges them in decreasing order according to their weights. Next, according to the given percentage of summarization, a particular number of sentences are selected from that ordered list. The proposed approach gives best results upto 50% summarization of the original text and gives satisfactory result even upto 25% summarization of the original text.
Keywords
database management systems; information retrieval; text analysis; unsupervised learning; automatic text summarization; hand tagged rules; online semantic dictionary WordNet; simplified lesk algorithm; text retrieval; unsupervised learning methodology; Abstracts; Conferences; Data mining; Dictionaries; Presses; Semantics; Abstract; Automatic Text Summarization; Extract; Lesk algorithm; WordNet;
fLanguage
English
Publisher
ieee
Conference_Titel
Advance Computing Conference (IACC), 2014 IEEE International
Conference_Location
Gurgaon
Print_ISBN
978-1-4799-2571-1
Type
conf
DOI
10.1109/IAdCC.2014.6779492
Filename
6779492
Link To Document