L_member.sp11 - CS 525 Advanced Distributed Systems Spring...

Info icon This preview shows pages 1–14. Sign up to view the full content.

View Full Document Right Arrow Icon
1 CS 525 Advanced Distributed Systems Spring 2011 Indranil Gupta (Indy) Membership Protocols (and Failure Detectors) March 31, 2011 All Slides © IG
Image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
2 Target Settings Process ‘group’-based systems Clouds/Datacenters Replicated servers Distributed databases Crash-stop/Fail-stop process failures
Image of page 2
3 Group Membership Service Application Queries Application Queries e.g., gossip, overlays, e.g., gossip, overlays, DHT’s, etc. DHT’s, etc. Membership Protocol Group Membership List joins, leaves, failures of members Unreliable Unreliable Communication Communication Application Process pi Membership List
Image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
4 Two sub-protocols Dissemination Failure Detector Application Process pi pj Group Membership List Unreliable Unreliable Communication Communication Almost-Complete list (focus of this talk) Gossip-style, SWIM, Virtual synchrony, … Or Partial-random list (other papers) SCAMP, T-MAN, Cyclon,…
Image of page 4
5 Large Group: Scalability A Goal this is us ( pi ) Unreliable Communication Unreliable Communication Network Network 1000’s of processes Process Group “Members”
Image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
6 pj I pj crashed Group Membership Protocol Unreliable Communication Unreliable Communication Network Network pi Some process finds out quickly Failure Detector II Dissemination III Crash-stop Failures only
Image of page 6
7 I. pj crashes Nothing we can do about it! A frequent occurrence Common case rather than exception Frequency goes up at least linearly with size of datacenter
Image of page 7

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
8 II. Distributed Failure Detectors: Desirable Properties Completeness = each failure is detected Accuracy = there is no mistaken detection Speed Time to first detection of a failure Scale Equal Load on each member Network Message Load
Image of page 8
9 Distributed Failure Detectors: Properties Completeness Accuracy Speed Time to first detection of a failure Scale Equal Load on each member Network Message Load Impossible together in lossy networks [Chandra and Toueg] If possible, then can solve consensus!
Image of page 9

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
10 What Real Failure Detectors Prefer Completeness Accuracy Speed Time to first detection of a failure Scale Equal Load on each member Network Message Load Guaranteed Partial/Probabilistic guarantee
Image of page 10
11 Failure Detector Properties Completeness Accuracy Speed Time to first detection of a failure Scale Equal Load on each member Network Message Load Time until some process detects the failure Guaranteed Partial/Probabilistic guarantee
Image of page 11

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
12 Failure Detector Properties Completeness Accuracy Speed Time to first detection of a failure Scale Equal Load on each member Network Message Load Time until some process detects the failure Guaranteed Partial/Probabilistic guarantee No bottlenecks/single failure point
Image of page 12
13 Failure Detector Properties Completeness Accuracy Speed Time to first detection of a failure Scale Equal Load on each member Network Message Load In spite of arbitrary simultaneous process failures
Image of page 13

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
Image of page 14
This is the end of the preview. Sign up to access the rest of the document.

{[ snackBarMessage ]}

What students are saying

  • Left Quote Icon

    As a current student on this bumpy collegiate pathway, I stumbled upon Course Hero, where I can find study resources for nearly all my courses, get online help from tutors 24/7, and even share my old projects, papers, and lecture notes with other students.

    Student Picture

    Kiran Temple University Fox School of Business ‘17, Course Hero Intern

  • Left Quote Icon

    I cannot even describe how much Course Hero helped me this summer. It’s truly become something I can always rely on and help me. In the end, I was not only able to survive summer classes, but I was able to thrive thanks to Course Hero.

    Student Picture

    Dana University of Pennsylvania ‘17, Course Hero Intern

  • Left Quote Icon

    The ability to access any university’s resources through Course Hero proved invaluable in my case. I was behind on Tulane coursework and actually used UCLA’s materials to help me move forward and get everything together on time.

    Student Picture

    Jill Tulane University ‘16, Course Hero Intern