DocumentCode :
1542780
Title :
Document ranking and the vector-space model
Author :
Lee, Dik L. ; Chuang, Huei ; Seamons, Kent
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., Hong Kong
Volume :
14
Issue :
2
fYear :
1997
Firstpage :
67
Lastpage :
75
Abstract :
Efficient and effective text retrieval techniques are critical in managing the increasing amount of textual information available in electronic form. Yet text retrieval is a daunting task because it is difficult to extract the semantics of natural language texts. Many problems must be resolved before natural language processing techniques can be effectively applied to a large collection of texts. Most existing text retrieval techniques rely on indexing keywords. Unfortunately, keywords or index terms alone cannot adequately capture the document contents, resulting in poor retrieval performance. Yet keyword indexing is widely used in commercial systems because it is still the most viable way by far to process large amounts of text. Using several simplifications of the vector-space model for text retrieval queries, the authors seek the optimal balance between processing efficiency and retrieval effectiveness as expressed in relevant document rankings
Keywords :
document handling; indexing; information retrieval; natural language interfaces; statistical analysis; vocabulary; commercial systems; document rankings; index terms; indexing; keywords; natural language text; retrieval performance; statistical analysis; text retrieval techniques; textual information; vector-space model; Content based retrieval; Data mining; Database languages; Feedback; Indexing; Information retrieval; Information systems; Particle measurements; Prototypes; Technology management;
fLanguage :
English
Journal_Title :
Software, IEEE
Publisher :
ieee
ISSN :
0740-7459
Type :
jour
DOI :
10.1109/52.582976
Filename :
582976
Link To Document :
بازگشت