• DocumentCode
    3570908
  • Title

    Real-time anomaly detection over VMware performance data using storm

  • Author

    Solaimani, Mohiuddin ; Khan, Latifur ; Thuraisingham, Bhavani

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at Dallas, Richardson, TX, USA
  • fYear
    2014
  • Firstpage
    458
  • Lastpage
    465
  • Abstract
    Anomaly detection is the identification of items or observations which deviate from an expected pattern in a dataset. This paper proposes a novel real time anomaly detection framework for dynamic resource scheduling of a VMware-based cloud data center. The framework monitors VMware performance stream data (e.g. CPU load, memory usage, etc.). Hence, the framework continuously needs to collect data and make decision without any delay. We have used Apache Storm, distributed framework for handling performance stream data and making prediction without any delay. Storm is chosen over a traditional distributed framework (e.g., Hadoop and MapReduce, Mahout) that is good for batch processing. An incremental clustering algorithm to model benign characteristics is incorporated in our storm-based framework. During continuous incoming test stream, if the model finds data deviated from its benign behavior, it considers that as an anomaly. We have shown effectiveness of our framework by providing real-time complex analytic functionality over stream data.
  • Keywords
    cloud computing; computer centres; data handling; distributed processing; pattern clustering; scheduling; Apache Storm; VMware performance stream data handling; VMware-based cloud data center; batch processing; data collection; decision making; distributed framework; dynamic resource scheduling; incremental clustering algorithm; item identification; real-time anomaly detection framework; real-time complex analytic functionality; Clustering algorithms; Data models; Dynamic scheduling; Fasteners; Real-time systems; Storms; Training; Anomaly detection; Data center; Incremental clustering; Real-time anomaly detection; Resource scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
  • Type

    conf

  • DOI
    10.1109/IRI.2014.7051925
  • Filename
    7051925