• DocumentCode
    1279142
  • Title

    Finding aggregate proximity relationships and commonalities in spatial data mining

  • Author

    Knorr, Edwin M. ; Ng, Raymond T.

  • Author_Institution
    Dept. of Comput. Sci., British Columbia Univ., Vancouver, BC, Canada
  • Volume
    8
  • Issue
    6
  • fYear
    1996
  • fDate
    12/1/1996 12:00:00 AM
  • Firstpage
    884
  • Lastpage
    897
  • Abstract
    Studies two spatial knowledge discovery problems involving proximity relationships between clusters and features. The first problem is: given a cluster of points, how can we efficiently find features (represented as polygons) that are closest to the majority of points in the cluster? We measure proximity in an aggregate sense due to the nonuniform distribution of points in a cluster (e.g. houses on a map), and the different shapes and sizes of features (e.g. natural or man-made geographic features). The second problem is: given n clusters of points, how can we extract the aggregate proximity commonalities (i.e. features) that apply to most, if not all, of the n clusters? Regarding the first problem, the main contribution of the paper is the development of Algorithm CRH (Circle, Rectangle and Hull), which uses geometric approximations (i.e. encompassing circles, isothetic rectangles and convex hulls) to filter and select features. The highly scalable and incremental Algorithm CRH can examine over 50,000 features and their spatial relationships with a given cluster in approximately one second of CPU time. Regarding the second problem, the key contribution is the development of Algorithm GenCom (Generalization for Commonality extraction) that makes use of concept generalization to effectively derive many meaningful commonalities that cannot be found otherwise
  • Keywords
    computational geometry; deductive databases; feature extraction; geographic information systems; knowledge acquisition; pattern recognition; spatial reasoning; visual databases; Algorithm CRH; Algorithm GenCom; CPU time; aggregate proximity relationships; commonality extraction; concept generalization; convex hulls; encompassing circles; feature extraction; feature shapes; feature sizes; geographic information systems; geometric approximations; geometric filtering; isothetic rectangles; nonuniform distribution; point clusters; polygons; spatial data mining; spatial knowledge discovery problems; Aggregates; Application software; Cities and towns; Data analysis; Data mining; Humans; Relational databases; Satellites; Shape; Spatial databases;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.553156
  • Filename
    553156