Title :
Study on semantic paragraph partition in automatic abstracting system
Author :
Min, Wan ; Zhensheng, Luo ; Yuqing, Guo
Author_Institution :
Lab. of Computational Linguistics, Tsinghua Univ., Beijing, China
Abstract :
Semantic paragraph partition is an important problem in text structure analysis in an automatic abstracting system. For an article containing distinct headings, the paper presents heading models in Chinese text to divide an article into semantic paragraphs based on the recognition of headings. For an article not containing headings, the paper establishes a vector space model for the whole article based on paragraphs, and then semantically relative paragraphs are clustered as semantic paragraphs
Keywords :
abstracting; computational linguistics; natural languages; pattern clustering; text analysis; Chinese text; automatic abstracting system; heading recognition; semantic paragraph partition; semantically relative paragraphs; text structure analysis; vector space model; Character recognition; Computational linguistics; Data mining; Functional analysis; Space technology; Text recognition; Wide area networks;
Conference_Titel :
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location :
Tucson, AZ
Print_ISBN :
0-7803-7087-2
DOI :
10.1109/ICSMC.2001.973030