DocumentCode
1279142
Title
Finding aggregate proximity relationships and commonalities in spatial data mining
Author
Knorr, Edwin M. ; Ng, Raymond T.
Author_Institution
Dept. of Comput. Sci., British Columbia Univ., Vancouver, BC, Canada
Volume
8
Issue
6
fYear
1996
fDate
12/1/1996 12:00:00 AM
Firstpage
884
Lastpage
897
Abstract
Studies two spatial knowledge discovery problems involving proximity relationships between clusters and features. The first problem is: given a cluster of points, how can we efficiently find features (represented as polygons) that are closest to the majority of points in the cluster? We measure proximity in an aggregate sense due to the nonuniform distribution of points in a cluster (e.g. houses on a map), and the different shapes and sizes of features (e.g. natural or man-made geographic features). The second problem is: given n clusters of points, how can we extract the aggregate proximity commonalities (i.e. features) that apply to most, if not all, of the n clusters? Regarding the first problem, the main contribution of the paper is the development of Algorithm CRH (Circle, Rectangle and Hull), which uses geometric approximations (i.e. encompassing circles, isothetic rectangles and convex hulls) to filter and select features. The highly scalable and incremental Algorithm CRH can examine over 50,000 features and their spatial relationships with a given cluster in approximately one second of CPU time. Regarding the second problem, the key contribution is the development of Algorithm GenCom (Generalization for Commonality extraction) that makes use of concept generalization to effectively derive many meaningful commonalities that cannot be found otherwise
Keywords
computational geometry; deductive databases; feature extraction; geographic information systems; knowledge acquisition; pattern recognition; spatial reasoning; visual databases; Algorithm CRH; Algorithm GenCom; CPU time; aggregate proximity relationships; commonality extraction; concept generalization; convex hulls; encompassing circles; feature extraction; feature shapes; feature sizes; geographic information systems; geometric approximations; geometric filtering; isothetic rectangles; nonuniform distribution; point clusters; polygons; spatial data mining; spatial knowledge discovery problems; Aggregates; Application software; Cities and towns; Data analysis; Data mining; Humans; Relational databases; Satellites; Shape; Spatial databases;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.553156
Filename
553156
Link To Document