• DocumentCode
    169995
  • Title

    NACluster: A Non-supervised Clustering Algorithm for Matching Multi Catalogues

  • Author

    Pires de Moura Freire, Vinicius ; Machado Porto, Fabio Andre ; Fernandes de Macedo, Jose Antonio ; Akbarinia, Reza

  • Author_Institution
    Fed. Univ. of Ceara Fortaleza, Fortaleza, Brazil
  • Volume
    2
  • fYear
    2014
  • fDate
    20-24 Oct. 2014
  • Firstpage
    83
  • Lastpage
    86
  • Abstract
    Astronomy surveys use powerful instruments to browse the sky and identify objects of interest within the surveyed region. Sky objects are individually characterized with spatial coordinates, identifying their position in the sky, in addition to other descriptive attributes. Composing an integrated view of the sky based on catalogues produced by different surveys faces a hard problem of matching objects that have been captured in various catalogues. Due to variations on capturing instruments calibration, the sky position of a single sky object may vary from a catalog to the other. Moreover, in particular dense regions of the sky this problem is exacerbated by a huge number of candidate matches for each given object. Traditional approaches for dealing with this problem use a threshold distance of "&b.epsi;" to reduce the number of matching candidates. Additionally, they adopt a pairwise approach for matching "n" catalogues inferring transitivity among matches, which not always hold. In this paper, we present NACluster a non-supervised clustering algorithm for dealing with sky object matching in multiple catalogues. NACluster matching strategy extends the traditional k-means clustering algorithm by relaxing the number k of cluster (i.e. matched sky objects). We experiment NACluster with real and synthetic catalogues and show that the results present better accuracy than state of the art solutions.
  • Keywords
    astronomy computing; cataloguing; pattern clustering; pattern matching; NACluster matching strategy; astronomy; descriptive attributes; instruments calibration; k-means clustering algorithm; matching candidate; multicatalogues; nonsupervised clustering algorithm; real catalogue; sky object matching; sky objects; sky position; spatial coordinates; surveyed region; synthetic catalogue; Accuracy; Astronomy; Catalogs; Clustering algorithms; Gaussian distribution; Object recognition; Search problems; astronomy; cross-matching; nacluster;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Science (e-Science), 2014 IEEE 10th International Conference on
  • Conference_Location
    Sao Paulo
  • Print_ISBN
    978-1-4799-4288-6
  • Type

    conf

  • DOI
    10.1109/eScience.2014.61
  • Filename
    6972103