Title :
A Distributed Data Allocation Algorithm for Biological Databases
Author :
Tonini, Gustavo ; Siqueira, Frank
Author_Institution :
Dept. of Inf. & Stat., Fed. Univ. of Santa Catarina, Florianopolis, Brazil
Abstract :
Storage and processing of large data sets on distributed platforms allows parallel query execution and is capable of improving scalability. However, defining a distributed allocation schema is a complex task that has been based mostly on ad-hoc, trial-and-error strategies. This paper describes an algorithm for creating a distributed allocation schema aimed at improving query performance. The algorithm is based on data patterns and query history analysis and can be applied to any existing centralized database. The proposed algorithm was evaluated using a large biological database as case study, achieving very promising results.
Keywords :
biology computing; data handling; distributed algorithms; biological databases; centralized database; data patterns; distributed allocation schema; distributed data allocation algorithm; distributed platforms; large data sets; parallel query execution; query history analysis; query performance; trial-and-error strategies; Algorithm design and analysis; Biological cells; Clustering algorithms; Distributed databases; Resource management; biological data; data allocation; datawarehouse; distributed databases; parallel processing;
Conference_Titel :
Computational Science and Engineering (CSE), 2013 IEEE 16th International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/CSE.2013.85