• DocumentCode
    1374640
  • Title

    Passive Network Performance Estimation for Large-Scale, Data-Intensive Computing

  • Author

    Kim, Jinoh ; Chandra, Abhishek ; Weissman, Jon B.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Minnesota, Minneapolis, MN, USA
  • Volume
    22
  • Issue
    8
  • fYear
    2011
  • Firstpage
    1365
  • Lastpage
    1373
  • Abstract
    Distributed computing applications are increasingly utilizing distributed data sources. However, the unpredictable cost of data access in large-scale computing infrastructures can lead to severe performance bottlenecks. Providing predictability in data access is, thus, essential to accommodate the large set of newly emerging large-scale, data-intensive computing applications. In this regard, accurate estimation of network performance is crucial to meeting the performance goals of such applications. Passive estimation based on past measurements is attractive for its relatively small overhead compared to relying on explicit probing. In this paper, we take a passive approach for network performance estimation. Our approach is different from existing passive techniques that rely either on past direct measurements of pairs of nodes or on topological similarities. Instead, we exploit secondhand measurements collected by other nodes without any topological restrictions. In this paper, we present Overlay Passive Estimation of Network performance (OPEN), a scalable framework providing end-to-end network performance estimation based on secondhand measurements, and discuss how OPEN achieves cost-effective estimation in a large-scale infrastructure. Our extensive experimental results show that OPEN estimation can be applicable for replica and resource selections commonly used in distributed computing.
  • Keywords
    information retrieval; parallel processing; OPEN; data access; data intensive computing; distributed computing applications; distributed data sources; end-to-end network performance estimation; large-scale computing infrastructure; overlay passive estimation of network performance; passive network performance estimation; performance bottlenecks; Distributed databases; Estimation; Extraterrestrial measurements; Optimization; Peer to peer computing; Servers; Network performance estimation; data-intensive computing; replica selection; resource selection.; secondhand estimation;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2010.201
  • Filename
    5629337