DocumentCode
2980933
Title
Leveraging Cloud Infrastructure for Troubleshooting Edge Computing Systems
Author
Fagan, Michael ; Khan, Mohammad Maifi Hasan ; Bing Wang
Author_Institution
Dept. of CSE, Univ. of Connecticut, Storrs, CT, USA
fYear
2012
fDate
17-19 Dec. 2012
Firstpage
440
Lastpage
447
Abstract
Modern cloud-based applications (e.g., Face book, Dropbox) serve a wide range of edge clients (e.g., laptops, smart phones). The clients´ characteristics vary significantly in terms of hardware (e.g., high end desktop vs. resource constrained smart phones), operating systems (e.g., Linux, Android, Mac OS, Windows), network connections (e.g., wireless vs. wired, 3G vs. 2G), and software versions (e.g., Firefox 12 vs. Firefox 13), just to name a few. Unfortunately, due to misconfiguration, outdated software, faulty hardware, or other reasons, many edge systems operate at suboptimal performance. Poor performance and root cause identification is extremely challenging for the client of the cloud system. To address this challenge, the troubleshooting service presented in this paper leverages such heterogeneity to identify and debug performance problems on edge devices. First, by looking at many runs across many different clients, the service groups clients in different clusters based on performance. Next, the service enables logging on remote clients to collect run time traces, and subsequently identifies the root cause by analyzing logs automatically. We leverage high level features such as machine/OS type along with more low level kernel level statistics such as I/O rate and system calls. To demonstrate our system we first introduce a configuration bug that was artificially injected in a recently built cluster by changing the TCP buffer size. Next, we present two real-life bugs, one I/O inefficiency bug relating to network transfers on Android, and another misconfiguration bug in VirtualBox, that were identified using our tool.
Keywords
cloud computing; Android; Dropbox; Facebook; Linux; Mac OS; TCP buffer size; Windows; cloud based applications; edge clients; laptops; leveraging cloud infrastructure; network connections; operating systems; smart phones; troubleshooting edge computing systems; Computer bugs; Engines; Monitoring; Performance evaluation; Servers; Smart phones; Software; automated debugging; cloud service; distributed computing; middleware; system calls;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems (ICPADS), 2012 IEEE 18th International Conference on
Conference_Location
Singapore
ISSN
1521-9097
Print_ISBN
978-1-4673-4565-1
Electronic_ISBN
1521-9097
Type
conf
DOI
10.1109/ICPADS.2012.67
Filename
6413665
Link To Document