• DocumentCode
    3781707
  • Title

    Management of Unstructured Geological Data Based on Hadoop

  • Author

    Dongqi Wei;Yueqin Zhu

  • Author_Institution
    Xian Center of Geol. Survey, Xian, China
  • fYear
    2015
  • Firstpage
    432
  • Lastpage
    435
  • Abstract
    For over a century Geological Survey of China (CGS) has accumulated an enormous amount of data on geology, mineral resources and petroleum fields of China. Many of this geological information were recorded in unstructured formats. How to derive useful knowledge from these data for industrial and research users is a challenging problem for a long period. In this paper, we developed a hierarchical model to store the unstructured geological data based on Hadoop. Using a modified KNN (K-Nearest Neighbor) algorithm based on MapReduce framework, top 10 geological words were identified and analyzed in the unstructured geological test data.
  • Keywords
    "Geology","Data models","Classification algorithms","Object oriented modeling","Entropy","Databases","Text categorization"
  • Publisher
    ieee
  • Conference_Titel
    Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom), 2015 IEEE 12th Intl Conf on
  • Type

    conf

  • DOI
    10.1109/UIC-ATC-ScalCom-CBDCom-IoP.2015.93
  • Filename
    7518272