Title :
Management of Unstructured Geological Data Based on Hadoop
Author :
Dongqi Wei;Yueqin Zhu
Author_Institution :
Xian Center of Geol. Survey, Xian, China
Abstract :
For over a century Geological Survey of China (CGS) has accumulated an enormous amount of data on geology, mineral resources and petroleum fields of China. Many of this geological information were recorded in unstructured formats. How to derive useful knowledge from these data for industrial and research users is a challenging problem for a long period. In this paper, we developed a hierarchical model to store the unstructured geological data based on Hadoop. Using a modified KNN (K-Nearest Neighbor) algorithm based on MapReduce framework, top 10 geological words were identified and analyzed in the unstructured geological test data.
Keywords :
"Geology","Data models","Classification algorithms","Object oriented modeling","Entropy","Databases","Text categorization"
Conference_Titel :
Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom), 2015 IEEE 12th Intl Conf on
DOI :
10.1109/UIC-ATC-ScalCom-CBDCom-IoP.2015.93