slides.0310.2011 - Geo-distribution in Storage -Jason Croft...

Info iconThis preview shows pages 1–13. Sign up to view the full content.

View Full Document Right Arrow Icon
Geo-distribution in Storage -Jason Croft and Anjali Sridhar
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Outline Introduction Smoke and Mirrors RACS – Redundant Array of Cloud Storage Conclusion 2
Background image of page 2
Introduction Why do we need geo-distribution? Protection against data loss Options for data recovery Cost ? Physical Latency Manpower Power Redundancy/Replication 3
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
How to Minimize Cost ? Smoke and Mirror File System Latency RACS Monetary cost Volley Latency and Monetary cost Applications? 4
Background image of page 4
Smoke and Mirrors: Reflecting Files at a Geographically Remote Location Without Loss of Performance -Hakim Weatherspoon, Lakshmi Ganesh, Tudor Marian, Mahesh Balakrishnan, and Ken Birman, Cornell University, Computer Science Department & Microsoft Research, Silicon Valley ,FAST 2009 5
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Smoke and Mirrors Network sync tries to provide reliable transmission of data from the primary to the replicas with minimum latency Sensitive to high latency but require fault tolerance US Treasury, Finance Sector Technology Consortium and any corporation using transactional databases 6
Background image of page 6
Failure – Sequence or Rolling disaster The model assumes wide area optical link networks with high data rates which has sporadic , bursty packet loss . Experiments are based on observation of TeraGrid, a scientific data network linking supercomputers. 7
Background image of page 7

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Synchronous 1 2 3 4 5 CLIENT Disadvantage - Low performance due to latency Advantage - High reliability 8 PRIMARY Local storage site MIRROR Remote storage site
Background image of page 8
Asynchronous 1 2 4 CLIENT Advantage - High performance due to low latency Disadvantage -Low reliability 3 9 PRIMARY Local storage site MIRROR Remote storage site
Background image of page 9

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Semi-synchronous 1 2 3 4 CLIENT Advantage -Better reliability than asynchronous Disadvantage - More latency than synchronous 10 PRIMARY Local storage site MIRROR Remote storage site
Background image of page 10
Core Ideas Network Sync is close to the semi-synchronous model It uses egress and ingress routers to increase reliability The data packets along with forward error correcting packets are “stored” in the network after which an ack is sent to the client A better bet for applications 11
Background image of page 11

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Network Sync 1 2 3 5 CLIENT PRIMARY Local storage site MIRROR Remote storage site Ingress Router Egress Router Ingress and Egress Routers are gateway routers that form the boundary between the datacenter and the wide area network.
Background image of page 12
Image of page 13
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 52

slides.0310.2011 - Geo-distribution in Storage -Jason Croft...

This preview shows document pages 1 - 13. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online