DocumentCode :
2142188
Title :
A VSM-based data mining engine for geoscience documents
Author :
Lv, Peng ; Bi, Zhiwei ; Zhu, Pengfei ; Wu, Wen
Author_Institution :
Institute of remote sensing applications, China Academy of Sciences, Beijing, China
fYear :
2010
fDate :
4-6 Dec. 2010
Firstpage :
4909
Lastpage :
4912
Abstract :
With the development of information technology in geosciences, enormous data and documents can not be processed by ordinary methods. Furthermore it is difficult to precisely search the target document quickly. In this paper, we propose the use of vector space model (VSM) for automatic date mining of geosciences documents, and a VSM-based search engine system is designed and implemented, which includes three main components: 1)a word segment structure with two hash tables managing the first and the last words of a geo-item and a Trie tree containing the rest of words; 2) a linear space composited by all related documents which need the calculating of similarity; 3) a vector space module mapping documents to multi-dimensional vector space and comparing keywords with features of documents to decide the similarity. This system can make it convenient in geodata sharing and improves the work process efficiently.
Keywords :
Adaptation model; Computational modeling; Data mining; Engines; Geology; Information retrieval; Vectors; VSM; date mining; geoscience documents; trie tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Engineering (ICISE), 2010 2nd International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4244-7616-9
Type :
conf
DOI :
10.1109/ICISE.2010.5690957
Filename :
5690957
Link To Document :
بازگشت