مرکز منطقه ای اطلاع رساني علوم و فناوري - A Universal Full Text Index with Access Control and Annotation Driven Information Retrieval

DocumentCode :

2832726

Title :

A Universal Full Text Index with Access Control and Annotation Driven Information Retrieval

Author :

Chávez, Edgar ; Téllez, Eric Sadit

Author_Institution :

Univ. Michoacana

fYear :

2006

fDate :

Nov. 2006

Firstpage :

135

Lastpage :

140

Abstract :

Full text databases are tightly linked to the application layer. Currently IR projects must be integrated in the back-end using, at best, a general-purpose language-independent API. This architecture limits and precludes the rapid prototyping. In this paper we present a new approach, a very simple architecture, towards the development of a general purpose full-text database. We implemented a standard inverted file index, providing various extra capabilities. For each document stored we simply added a set of qualifiers, MD5 hashes and keywords, algorithmic ally unrelated to the document content. This allows to hierarchically control access to the document, iteratively improve document categorization, add and delete annotations, and document versions. All transactions are done through a standard Web service interface. This feature facilitates system integration, and testing. We describe a set of applications where our concept can be useful. The universe of applications for our concept encompass those areas where document annotations are relevant. Once stored and annotated (with qualifiers), the documents can be retrieved by a combination of qualifiers and document content. Additionally, we show our prototype in action, explaining how can be extended to support retrieval and storage models appeared in some popular sites recently

Keywords :

full-text databases; indexing; information retrieval; MD5 hashes; annotation driven information retrieval; document access control; document categorization; full-text database; standard inverted file index; universal full text index; Access control; Computer architecture; Content based retrieval; Databases; Information retrieval; Prototypes; Software prototyping; System testing; Web search; Web services;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computing, 2006. CIC '06. 15th International Conference on

Conference_Location :

Mexico City

Print_ISBN :

0-7695-2708-6

Type :

conf

DOI :

10.1109/CIC.2006.17

Filename :

4023800

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2832726