Title :
MRData: A MapReduce-Based Tool for Heterogeneous Data Integration
Author :
Xu, Liutong ; Jin, Kai ; Tian, Hongqiao
Author_Institution :
Beijing Key Lab. of Intell. Telecommun. Software & Multimedia, Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
As the volume of data increasing sharply and the relationship among different data sources becoming intricately, how to integrate mass data sources and how to find latent information from the integrated data is a matter of urgency. At present, industry tends to adopt distributed computing model to solve the integration of massive data. Aiming at getting the valuable and in-depth information, visualization is a critical step in data analysis and data mining. We design a tool called MRData for heterogeneous data integration which has two features: 1) parallel data processing based on Hadoop which is a distributed platform; 2) visual analysis. And at last, experiments verify the efficiency of MRData.
Keywords :
data analysis; data integrity; data mining; MRData; MapReduce-based tool; data analysis; data mining; data sources; distributed computing; heterogeneous data integration; parallel data processing; Business; Data mining; Data processing; Data visualization; Distributed databases; Semantics; Visualization; data integration; hadoop; mapreduce; visualization;
Conference_Titel :
Information Science and Management Engineering (ISME), 2010 International Conference of
Conference_Location :
Xi´an
Print_ISBN :
978-1-4244-7669-5
Electronic_ISBN :
978-1-4244-7670-1
DOI :
10.1109/ISME.2010.252