Title :
Towards a distributed federated architecture for digital documents
Author_Institution :
Amrita Vishwa Vidyapeetham, Coimbatore, India
Abstract :
Creating the federated architecture is the most significant issues in the field of digital library. Human perception is not uniform while measuring the relevance to automate the retrieval process. In this work we have designed a system for integrating the existing architectures for digital library. This architecture uses integrated systems such as metadata, standard descriptors, feature extraction etc for text searching and retrieval. Databases of different size were used to estimate the accuracy of the system. The proposed algorithm works on the concept of minimum weight tree that removes the irrelevant texts from the retrieved hits, based on the dynamic threshold provided to the algorithm. We found out that careful combination of the different features based on our proposed heuristic, can increase the creation of a unified architecture for digital libraries.
Keywords :
digital libraries; distributed processing; information retrieval; text analysis; trees (mathematics); digital documents; digital library; distributed federated architecture; dynamic threshold; feature extraction; integrated systems; irrelevant texts; metadata; minimum weight tree; standard descriptors; text retrieval; text searching; Data engineering; Standards; Databases; Digital library; Federated Architecture; Text mining;
Conference_Titel :
Digital Information Management (ICDIM), 2012 Seventh International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-2428-1
DOI :
10.1109/ICDIM.2012.6360144