Title :
Test model for summarizing hindi text using extraction method
Author :
Thaokar, Chetana ; Malik, Latesh
Author_Institution :
Dept. of Comput. Sci. & Eng., GH Raisoni Coll. of Eng., Nagpur, India
Abstract :
As amount of information available on the web is getting double day by day which is leading to information overload. To find important and useful information is becoming difficult. Automatic summary generation technique addresses the issue of generating shortened information from documents written on the same topic. This systems are most interested and attractive research areas. It offers a possibility of finding main points of texts and so user can spend less time on reading whole document. This Paper discusses the idea to summarize Hindi text documents using sentence extraction method. It uses Hindi Wordnet to tag appropriate POS of word for checking SOV of the sentence. It also uses genetic algorithm to optimize the summary generated based on the text feature terms which will cover maximum theme with less redundancy.
Keywords :
feature extraction; genetic algorithms; text analysis; Hindi Wordnet; Hindi text documents; Hindi text summarization; automatic summary generation technique; genetic algorithm; information overload; sentence extraction method; shortened information generation; text feature terms; Biological cells; Conferences; Feature extraction; Genetic algorithms; Pragmatics; Sociology; Statistics;
Conference_Titel :
Information & Communication Technologies (ICT), 2013 IEEE Conference on
Conference_Location :
JeJu Island
Print_ISBN :
978-1-4673-5759-3
DOI :
10.1109/CICT.2013.6558271