• DocumentCode
    688294
  • Title

    Measuring and Discovering Correlations in Large Data Sets

  • Author

    Lijue Liu ; Ming Li ; Sha Wen

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Central South Univ., Changsha, China
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    1302
  • Lastpage
    1307
  • Abstract
    In this paper, a class of statistics named ART (the alternant recursive topology statistics) is proposed to measure the properties of correlation between two variables. A wide range of bi-variable correlations both linear and nonlinear can be evaluated by ART efficiently and equitably even if nothing is known about the specific types of those relationships. ART compensates the disadvantages of Reshef\´s model in which no polynomial time precise algorithm exists and the "local random" phenomenon can not be identified. As a class of nonparametric exploration statistics, ART is applied for analyzing a dataset of 10 American classical indexes, as a result, lots of bi-variable correlations are discovered.
  • Keywords
    data analysis; statistics; ART; American classical indexes; alternant recursive topology statistics; bi-variable correlations; correlation properties; dataset analysis; large data set correlation discovery; linear correlations; nonlinear correlations; nonparametric exploration statistics; Correlation; Histograms; Indexes; Microwave integrated circuits; Polynomials; Subspace constraints; Topology; ART Statistics; Association Mining; Correlation Mining; Non-Linear Correlation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.185
  • Filename
    6832067