raid-guestlec

raid-guestlec - Guest Lecture for 15-440 Disk Array Data...

Info iconThis preview shows pages 1–11. Sign up to view the full content.

View Full Document Right Arrow Icon
October 2010, Greg Ganger © 1 Guest Lecture for 15-440 Disk Array Data Organizations and RAID
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
October 2010, Greg Ganger © 2 Plan for today Why have multiple disks? Storage capacity, performance capacity, reliability Load distribution problem and approaches disk striping Fault tolerance replication parity-based protection “RAID” and the Disk Array Matrix Rebuild
Background image of page 2
October 2010, Greg Ganger © 3 Why multi-disk systems? A single storage device may not provide enough storage capacity, performance capacity, reliability So, what is the simplest arrangement?
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
October 2010, Greg Ganger © 4 B0 B1 B2 B3 C0 C1 C2 C3 D0 D1 D2 D3 A0 A1 A2 A3 Just a bunch of disks (JBOD) Yes, it’s a goofy name industry really does sell “JBOD enclosures”
Background image of page 4
October 2010, Greg Ganger © 5 Disk Subsystem Load Balancing I/O requests are almost never evenly distributed Some data is requested more than other data Depends on the apps, usage, time, …
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
October 2010, Greg Ganger © 6 Disk Subsystem Load Balancing I/O requests are almost never evenly distributed Some data is requested more than other data Depends on the apps, usage, time, … What is the right data-to-disk assignment policy? Common approach: Fixed data placement Your data is on disk X, period! For good reasons too: you bought it or you’re paying more … Fancy: Dynamic data placement If some of your files are accessed a lot, the admin (or even system) may separate the “hot” files across multiple disks In this scenario, entire files systems (or even files) are manually moved by the system admin to specific disks
Background image of page 6
October 2010, Greg Ganger © 7 Disk Subsystem Load Balancing I/O requests are almost never evenly distributed Some data is requested more than other data Depends on the apps, usage, time, … What is the right data-to-disk assignment policy? Common approach: Fixed data placement Your data is on disk X, period! Fancy: Dynamic data placement If some of your files are accessed a lot, we may separate the “hot” files across multiple disks In this scenario, entire files systems (or even files) are manually moved by the system admin to specific disks Alternative: Disk striping Stripe all of the data across all of the disks
Background image of page 7

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
October 2010, Greg Ganger © 8 Disk Striping Interleave data across multiple disks Large file streaming can enjoy parallel transfers High throughput requests can enjoy thorough load balancing If blocks of hot files equally likely on all disks (really?) stripe unit or block Stripe File Foo:
Background image of page 8
October 2010, Greg Ganger © 9 Disk striping details How disk striping works Break up total space into fixed-size stripe units Distribute the stripe units among disks in round-robin Compute location of block #B as follows disk# = B % N (%=modulo, N = # of disks) LBN# = B / N (computes the LBN on given disk)
Background image of page 9

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
October 2010, Greg Ganger © 10 Now, What If A Disk Fails?
Background image of page 10
Image of page 11
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 11/02/2011 for the course CS 440 taught by Professor Anderson during the Spring '11 term at Carnegie Mellon.

Page1 / 36

raid-guestlec - Guest Lecture for 15-440 Disk Array Data...

This preview shows document pages 1 - 11. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online