DocumentCode :
2990191
Title :
An Efficient Linear Text Segmentation Algorithm Using Hierarchical Agglomerative Clustering
Author :
Wu, Ji-Wei ; Tseng, Judy C R ; Tsai, Wen-Nung
Author_Institution :
Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear :
2011
fDate :
3-4 Dec. 2011
Firstpage :
1081
Lastpage :
1085
Abstract :
Linear text segmentation aims at dividing a long text into several topical segments. It is beneficial to many natural language processing tasks, such as information retrieval and document summarization. In this article, an efficient linear text segmentation algorithm based on hierarchical agglomerative clustering is presented. The proposed linear text segmentation algorithm is implemented without auxiliary knowledge base, parameter setting, and user involvement. Experimental results show that the proposed linear text segmentation algorithm not only provides linear time computational complexity, but also provides comparable segmentation accuracy with several well-known linear text segmentation algorithms.
Keywords :
computational complexity; knowledge based systems; natural language processing; pattern clustering; text analysis; auxiliary knowledge base; efficient linear text segmentation algorithm; hierarchical agglomerative clustering; linear time computational complexity; natural language processing task; segmentation accuracy; Accuracy; Algorithm design and analysis; Clustering algorithms; Computational complexity; Error probability; Heuristic algorithms; Merging; NLP application; computational intelligence; hierarchical agglomerative clustering; text segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Security (CIS), 2011 Seventh International Conference on
Conference_Location :
Hainan
Print_ISBN :
978-1-4577-2008-6
Type :
conf
DOI :
10.1109/CIS.2011.240
Filename :
6128290
Link To Document :
بازگشت