• DocumentCode
    1857680
  • Title

    Exposing Complex Bug-Triggering Conditions in Distributed Systems via Graph Mining

  • Author

    Seo, Eunsoo ; Khan, Mohammad Maifi Hasan ; Mohapatra, Prasant ; Han, Prasant Mohapatra Jiawei ; Abdelzaher, Tarek

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2011
  • fDate
    13-16 Sept. 2011
  • Firstpage
    186
  • Lastpage
    195
  • Abstract
    Software bugs in distributed systems are notoriously hard to find due to the large number of components involved and the non-determinism introduced by race conditions between messages. This paper introduces Pop Mine, a tool for diagnosing corner-case bugs by finding the minimal causal directed acyclic graph (DAG) of events, spanning multiple processes, which captures a bug-triggering condition. Being based on causal order, a global notion of time is not required in uncovering bug-triggering distributed event patterns. Bug triggering event DAGs can be identified by comparing execution graphs from successful runs to those where bug manifestations were observed, and exposing the minimal discriminative event DAGs that may be responsible for the problem. This is a significant extension to prior debugging tools, in that prior work considered much simpler bug-triggering conditions such as single events, event sets, or ordered chains of events. To the authors´ knowledge, this is the first paper that considers bug-triggering conditions in the form of distributed event graphs. To prove the effectiveness of our approach, we applied our tool to VCP, Chord and GreenGPS and diagnosed bugs. We also present performance analysis results to demonstrate the scalability of our approach.
  • Keywords
    data mining; directed graphs; distributed processing; security of data; Chord tool; GreenGPS tool; Pop Mine tool; VCP tool; bug-triggering distributed event pattern; complex bug-triggering condition; corner-case bugs diagnosis; distributed system; graph mining; minimal causal directed acyclic graph; software bug; Computer bugs; Computer science; Data mining; Delay; Entropy; Software; USA Councils; Data mining; Fault diagnosis; Software debugging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2011 International Conference on
  • Conference_Location
    Taipei City
  • ISSN
    0190-3918
  • Print_ISBN
    978-1-4577-1336-1
  • Electronic_ISBN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2011.62
  • Filename
    6047187