Title :
Precise, Scalable, and Online Request Tracing for Multitier Services of Black Boxes
Author :
Sang, Bo ; Zhan, Jianfeng ; Lu, Gang ; Wang, Haining ; Xu, Dongyan ; Wang, Lei ; Zhang, Zhihong ; Jia, Zhen
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
fDate :
6/1/2012 12:00:00 AM
Abstract :
As more and more multitier services are developed from commercial off-the-shelf components or heterogeneous middleware without source code available, both developers and administrators need a request tracing tool to (1) exactly know how a user request of interest travels through services of black boxes and (2) obtain macrolevel user request behaviors of services without manually analyzing massive logs. This need is further exacerbated by IT system “agility,” which mandates the tracing tool to provide online performance data since offline approaches cannot reflect system changes in real time. Moreover, considering the large scale of deployed services, a pragmatic tracing approach should be scalable in terms of the cost in collecting and analyzing logs. In this paper, we introduce a precise, scalable, and online request tracing tool for multitier services of black boxes. Our contributions are threefold. First, we propose a precise request tracing algorithm for multitier services of black boxes, which only uses application-independent knowledge. Second, we present a microlevel abstraction, component activity graph, to represent causal paths of each request. On the basis of this abstraction, we use dominated causal path patterns to represent repeatedly executed causal paths that account for significant fractions, and we further present a derived performance metric of causal path patterns, latency percentages of components, to enable debugging performance-in-the-large. Third, we develop two mechanisms, tracing on demand and sampling, to significantly increase the system scalability. We implement a prototype of the proposed system, called PreciseTracer, and release it as open source code. In comparison with WAP5-a black-box tracing approach, PreciseTracer achieves higher tracing accuracy and faster response time. Our experimental results also show that PreciseTracer has low overhead, and still achieves high tracing accuracy even if an aggressive sampling - olicy is adopted, indicating that PreciseTracer is a promising tracing tool for large-scale production systems.
Keywords :
graph theory; middleware; IT system agility; PreciseTracer; WAP5; application-independent knowledge; black-box tracing approach; commercial off-the-shelf components; component activity graph; component latency percentages; debugging performance; demand tracing mechanism; dominated causal path patterns; heterogeneous middleware; large-scale production systems; microlevel abstraction; multitier services; online request tracing tool; pragmatic tracing approach; sampling mechanism; service macrolevel user request behaviors; Context; Correlators; Debugging; Engines; Kernel; Message systems; Middleware; Multitier service; black boxes; micro- and macrolevel abstractions; online analysis; performance debugging; precise request tracing; scalability.;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
DOI :
10.1109/TPDS.2011.257