Title :
Parallel Collection of Live Data Using Hadoop
Author :
Talattinis, Kyriacos ; Sidiropoulou, Aikaterini ; Chalkias, Konstantinos ; Stephanides, George
Author_Institution :
Dept. of Appl. Inf., Univ. of Macedonia, Thessaloniki, Greece
Abstract :
Hadoop is a fault tolerant Java framework that supports data distribution and process parallelization using commodity hardware. Based on the provided scalability and the independence of task execution, we combined Hadoop with crawling techniques to implement various applications that deal with large amount of data. Our experiments show that Hadoop is a very useful and trustworthy tool for creating distributed programs that perform better in terms of computational efficiency.
Keywords :
Java; cryptography; fault tolerance; parallel processing; Hadoop; commodity hardware; fault tolerant Java framework; live data; parallel collection; Cryptography; Databases; File systems; Force; Games; Hardware; Java; Hadoop; commodity hardware; distributed systems; live data collection; parallelization;
Conference_Titel :
Informatics (PCI), 2010 14th Panhellenic Conference on
Conference_Location :
Tripoli
Print_ISBN :
978-1-4244-7838-5
DOI :
10.1109/PCI.2010.47