DocumentCode :
1993674
Title :
Assigning geographical focus to documents
Author :
Chen, Min ; Lin, Xing ; Zhang, Yi ; Wang, Xingguang ; Yu, Hao
Author_Institution :
Inst. of Remote Sensing & Geogr. Inf. Syst., Peking Univ., Beijing, China
fYear :
2010
fDate :
18-20 June 2010
Firstpage :
1
Lastpage :
6
Abstract :
Geographical information becomes a kind of very important attribute for web documents, considering the fact that a large proportion of documents on the web contain geographical information. GIR (Geographical information retrieval) systems can identify those geographical information and extract the geographical focus in the documents automatically, hence supporting geo-related queries for information retrieval. Therefore, GIR has become a hot topic in both GIS and IR (Information Retrieval) areas recently. To take full advantage of geographical information within web documents in support of geo-related IR queries by returning more accurate results to users, a GIR system needs to get the geographical focus of the document, upon which a spatial index could then be established for a more accurate and efficient processing of spatial IR queries. So among all those steps within a GIR system, how to get the geographical focus for each document remains an essential one. In response to this demand, authors of this paper present a novel and promising algorithm. Before our explanation of proposed algorithm, we first briefly introduce SASEIC (Spatial-Aware Search Engine in Chinese)-a GIR prototype System we have implemented for the convenience of our research in GIR field. Then we start our description of proposed algorithm with the analysis of various possible PNPs (Place Name Patterns) within documents. After that, we present the algorithm with detailed principle and steps, which is conceived based on hierarchical structure of placenames within the documents for retrieval. Finally, at the end of this paper, we show the results of evaluation work for the proposed algorithm and draw our conclusions for this paper, as well as important directions of our future research.
Keywords :
Internet; document handling; geographic information systems; information retrieval; GIS; SASEIC; Spatial-Aware Search Engine in Chinese; Web documents; geographical focus; geographical information retrieval; place name patterns; spatial index; Algorithm design and analysis; Cities and towns; Entropy; Pediatrics; Prototypes; Web pages; Geographical focus; Geographical information retrieval; Hierarchical relationship; Place Name Pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Geoinformatics, 2010 18th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-7301-4
Type :
conf
DOI :
10.1109/GEOINFORMATICS.2010.5567598
Filename :
5567598
Link To Document :
بازگشت