• DocumentCode
    1757760
  • Title

    The Emerging "Big Dimensionality"

  • Author

    Yiteng Zhai ; Yew-Soon Ong ; Tsang, Ivor W.

  • Author_Institution
    Centre for Comput. Intell. (C2i), Nanyang Technol. Univ. (NTU), Singapore, Singapore
  • Volume
    9
  • Issue
    3
  • fYear
    2014
  • fDate
    Aug. 2014
  • Firstpage
    14
  • Lastpage
    26
  • Abstract
    The world continues to generate quintillion bytes of data daily, leading to the pressing needs for new efforts in dealing with the grand challenges brought by Big Data. Today, there is a growing consensus among the computational intelligence communities that data volume presents an immediate challenge pertaining to the scalability issue. However, when addressing volume in Big Data analytics, researchers in the data analytics community have largely taken a one-sided study of volume, which is the "Big Instance Size" factor of the data. The flip side of volume which is the dimensionality factor of Big Data, on the other hand, has received much lesser attention. This article thus represents an attempt to fill in this gap and places special focus on this relatively under-explored topic of "Big Dimensionality", wherein the explosion of features (variables) brings about new challenges to computational intelligence. We begin with an analysis on the origins of Big Dimensionality. The evolution of feature dimensionality in the last two decades is then studied using popular data repositories considered in the data analytics and computational intelligence research communities. Subsequently, the state-of-the-art feature selection schemes reported in the field of computational intelligence are reviewed to reveal the inadequacies of existing approaches in keeping pace with the emerging phenomenon of Big Dimensionality. Last but not least, the "curse and blessing of Big Dimensionality" are delineated and deliberated.
  • Keywords
    Big Data; artificial intelligence; feature selection; Big Data analytics; big dimensionality; big instance size factor; computational intelligence research communities; data analytics community; data repositories; data volume; feature dimensionality; feature selection schemes; Big data; Cellular phones; Computational intelligence; Data processing; Feature extraction; information processing;
  • fLanguage
    English
  • Journal_Title
    Computational Intelligence Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1556-603X
  • Type

    jour

  • DOI
    10.1109/MCI.2014.2326099
  • Filename
    6853478