DocumentCode :
3317348
Title :
Automatic subject indexing of Chinese documents
Author :
Zhang, Sulan ; He, Qing ; Zheng, Zheng ; Shi, Zhongzhi
Author_Institution :
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
fYear :
2005
fDate :
30 Oct.-1 Nov. 2005
Firstpage :
256
Lastpage :
261
Abstract :
Automatic subject indexing is a process to produce automatically a set of attributes that represent the content or topic of a document. In this paper, two approaches of automatic subject indexing based on VSM (vector space model) and subject words segmentation respectively are presented. The experimental results show that the first approach based on VSM is appropriate when the documents, which are indexed, are concentrative and the subject words available are less. The second approach based on subject words segmentation improves greatly efficiency of indexing and inter-indexer consistency.
Keywords :
indexing; word processing; automatic subject indexing; document indexing; inter-indexer consistency; subject word segmentation; vector space model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
Type :
conf
DOI :
10.1109/NLPKE.2005.1598744
Filename :
1598744
Link To Document :
بازگشت