Title :
An entity based RDF indexing schema using Hadoop and HBase
Author :
Abiri, Fateme ; Kahani, Mohsen ; Zarinkalam, Fatane
Author_Institution :
Dept. of Comput. Eng., Ferdowsi Univ. Mashhad, Mashhad, Iran
Abstract :
Recent development of semantic web has opened new research to design search engines which organize and manage semantic data. The core of a search engine is the indexing system which consists of two main parts: data storage and data retrieval. With the increasing amount of semantic data, the most important goal expected from an indexing system is the ability to store large amount of data and retrieve them as fast as possible. In other words, having a scalable indexing system is one of the major challenges in semantic search engines. In this paper, a scalable method is presented to index the RDF data which utilizes HBase database, a NOSQL database management system, as its underlying data storage. HBase provides random access to massive data on the distributed framework of Hadoop, therefore, it can be a proper option for the management of the massive data. Further, due to the importance and popularity of the entity-based queries, a new schema based on a clustering algorithm is designed to effectively respond to this type of queries. The experimental evaluation shows that the proposed indexing system is effective in terms of improving scalability and retrieval of RDF data.
Keywords :
data handling; database management systems; indexing; parallel processing; pattern clustering; query processing; search engines; semantic Web; HBase; HBase database; Hadoop; NOSQL database management system; RDF data retrieval; RDF data scalability; data retrieval; data storage; entity based RDF indexing schema; entity-based queries; random access; search engines; semantic Web; semantic data management; semantic search engines; Distributed databases; Indexing; Query processing; Resource description framework; Scalability; Agglomerative Clustering Algorithm; Entity Based Queries; HBase; Hadoop; NOSQL Database; RDF Indexing;
Conference_Titel :
Computer and Knowledge Engineering (ICCKE), 2014 4th International eConference on
Conference_Location :
Mashhad
Print_ISBN :
978-1-4799-5486-5
DOI :
10.1109/ICCKE.2014.6993400