Title :
Evaluation on geospatial information extraction and retrieval: Mining thematic maps from web source
Author :
Dewandaru, Agung ; Supriana S, Iping ; Akbar, Saiful
Author_Institution :
Sch. of Electr. Eng. & Inf., Bandung Inst. of Technol., Bandung, Indonesia
Abstract :
The World Wide Web easily becomes the largest repository of natural language text data. We are particularly interested in state-of-the-art methods in exploiting geospatial information the web. The survey is done in the context of its extraction methods, retrieval, visualization, and further possible mining or knowledge discovery scenarios in order to produce thematic maps automatically from the web corpus. We found that Web-based Geographic Information Retrieval (GIR) methods that returns selected relevant area instead of points is still lacking, even though area modeling is common in GIS. We also found that most GIR methods is still focused on places and buildings instead of theme or information around some area. Thus it indicates that the state of the art GIR methods are not yet sufficient for thematic extraction and retrieval to generate thematic maps from web corpus. Bayesian topic models such as Latent Dirichlet Allocation may serve as a good basis to serve such use cases.
Keywords :
Internet; cartography; data mining; geographic information systems; information retrieval; Bayesian topic models; GIR method; Web source; geospatial information exploitation; geospatial information extraction; geospatial information retrieval; knowledge discovery; latent Dirichlet allocation; natural language text data; thematic maps mining; Context; Data mining; Geospatial analysis; Information retrieval; Measurement; Natural languages; Prototypes; geographic information retrieval; information extraction; information retrieval; information visualization; knowledge discovery; thematic extraction; thematic maps; topic modeling; web mining;
Conference_Titel :
Information and Communication Technology (ICoICT ), 2015 3rd International Conference on
Conference_Location :
Nusa Dua
DOI :
10.1109/ICoICT.2015.7231437