DocumentCode :
2194547
Title :
Automatic Summarization for Chinese Text Based on Combined Words Recognition and Paragraph Clustering
Author :
Jiang Chang-jin ; Peng Hong ; Ma Qian-li ; Chen Jian-chao
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
fYear :
2010
fDate :
2-4 April 2010
Firstpage :
591
Lastpage :
594
Abstract :
With the tremendous amount of information available electronically, there is an increasing requirement for automatic text summarization systems. An extractive summarization method is represented. The weight of a Chinese word/phrase is computed based on its frequency, part of speech, position and length. The weight of a Chinese sentence is computed by its content, position, length and cue words in it. The adjacent paragraphs are clustered into same cluster or different clusters according to their similarity. The experiment results show that the proposed algorithm has a significantly better performance compared with the traditional automated summarization algorithms based on TF-ISF method.
Keywords :
pattern clustering; text analysis; Chinese phrase computation; Chinese sentence computation; Chinese word computation; TF-ISF method; automatic Chinese text summarization systems; combined words recognition; extractive summarization method; paragraph clustering; Artificial intelligence; Clustering algorithms; Data mining; Dictionaries; Frequency; Information retrieval; Information technology; Natural languages; Speech; Text recognition; Chinese combined word; automatic summarization; paragraph clustering; weight computation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Information Technology and Security Informatics (IITSI), 2010 Third International Symposium on
Conference_Location :
Jinggangshan
Print_ISBN :
978-1-4244-6730-3
Electronic_ISBN :
978-1-4244-6743-3
Type :
conf
DOI :
10.1109/IITSI.2010.15
Filename :
5453695
Link To Document :
بازگشت