مرکز منطقه ای اطلاع رساني علوم و فناوري - Document ranking and the vector-space model

DocumentCode :

1542780

Title :

Document ranking and the vector-space model

Author :

Lee, Dik L. ; Chuang, Huei ; Seamons, Kent

Author_Institution :

Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., Hong Kong

Volume :

Issue :

fYear :

1997

Firstpage :

Lastpage :

Abstract :

Efficient and effective text retrieval techniques are critical in managing the increasing amount of textual information available in electronic form. Yet text retrieval is a daunting task because it is difficult to extract the semantics of natural language texts. Many problems must be resolved before natural language processing techniques can be effectively applied to a large collection of texts. Most existing text retrieval techniques rely on indexing keywords. Unfortunately, keywords or index terms alone cannot adequately capture the document contents, resulting in poor retrieval performance. Yet keyword indexing is widely used in commercial systems because it is still the most viable way by far to process large amounts of text. Using several simplifications of the vector-space model for text retrieval queries, the authors seek the optimal balance between processing efficiency and retrieval effectiveness as expressed in relevant document rankings

Keywords :

document handling; indexing; information retrieval; natural language interfaces; statistical analysis; vocabulary; commercial systems; document rankings; index terms; indexing; keywords; natural language text; retrieval performance; statistical analysis; text retrieval techniques; textual information; vector-space model; Content based retrieval; Data mining; Database languages; Feedback; Indexing; Information retrieval; Information systems; Particle measurements; Prototypes; Technology management;

fLanguage :

English

Journal_Title :

Software, IEEE

Publisher :

ieee

ISSN :

0740-7459

Type :

jour

DOI :

10.1109/52.582976

Filename :

582976

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1542780