DocumentCode
3781707
Title
Management of Unstructured Geological Data Based on Hadoop
Author
Dongqi Wei;Yueqin Zhu
Author_Institution
Xian Center of Geol. Survey, Xian, China
fYear
2015
Firstpage
432
Lastpage
435
Abstract
For over a century Geological Survey of China (CGS) has accumulated an enormous amount of data on geology, mineral resources and petroleum fields of China. Many of this geological information were recorded in unstructured formats. How to derive useful knowledge from these data for industrial and research users is a challenging problem for a long period. In this paper, we developed a hierarchical model to store the unstructured geological data based on Hadoop. Using a modified KNN (K-Nearest Neighbor) algorithm based on MapReduce framework, top 10 geological words were identified and analyzed in the unstructured geological test data.
Keywords
"Geology","Data models","Classification algorithms","Object oriented modeling","Entropy","Databases","Text categorization"
Publisher
ieee
Conference_Titel
Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom), 2015 IEEE 12th Intl Conf on
Type
conf
DOI
10.1109/UIC-ATC-ScalCom-CBDCom-IoP.2015.93
Filename
7518272
Link To Document