Ch22a_ParallelDBs-95 - Database Management Systems, 2 nd...

Info iconThis preview shows pages 1–3. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Database Management Systems, 2 nd Edition. Raghu Ramakrishnan and Johannes Gehrke 1 Parallel DBMS Slides by Joe Hellerstein, UCB, with some material from Jim Gray, Microsoft Research. See also: http://www.research.microsoft.com/research/BARC/Gray/PDB95.ppt Chapter 22, Part A Database Management Systems, 2 nd Edition. Raghu Ramakrishnan and Johannes Gehrke 2 Why Parallel Access To Data? 1 Terabyte 10 MB/s At 10 MB/s 1.2 days to scan 1 Terabyte 1,000 x parallel 1.5 minute to scan. Parallelism: divide a big problem into many smaller ones to be solved in parallel. B a n d w i d t h Database Management Systems, 2 nd Edition. Raghu Ramakrishnan and Johannes Gehrke 3 Parallel DBMS: Intro Y Parallelism is natural to DBMS processing – Pipeline parallelism: many machines each doing one step in a multi-step process. – Partition parallelism: many machines doing the same thing to different pieces of data. – Both are natural in DBMS! Pipeline Partition Any Sequential Program Any Sequential Program Sequential Sequential Sequential Sequential Any Sequential Program Any Sequential Program outputs split N ways, inputs merge M ways Database Management Systems, 2 nd Edition. Raghu Ramakrishnan and Johannes Gehrke 4 DBMS: The || Success Story Y DBMSs are the most (only?) successful application of parallelism. – Teradata, Tandem vs. Thinking Machines, KSR.. – Every major DBMS vendor has some || server – Workstation manufacturers now depend on || DB server sales. Y Reasons for success: – Bulk-processing (= partition ||-ism). – Natural pipelining. – Inexpensive hardware can do the trick! – Users/app-programmers don’t need to think in || Database Management Systems, 2 nd Edition. Raghu Ramakrishnan and Johannes Gehrke 5 Some || Terminology Y Speed-Up – More resources means proportionally less time for given amount of data. Y Scale-Up – If resources increased in proportion to increase in data size, time is constant. degree of ||-ism X a c t / s e c . ( t h r o u g h p u t ) Ideal degree of ||-ism s e c . / X a c t ( r e s p o n s e t i m e ) Ideal Database Management Systems, 2 nd Edition. Raghu Ramakrishnan and Johannes Gehrke 6 Architecture Issue: Shared What?...
View Full Document

This note was uploaded on 02/06/2010 for the course CSE 302 taught by Professor Joel during the Summer '05 term at Punjab Engineering College.

Page1 / 8

Ch22a_ParallelDBs-95 - Database Management Systems, 2 nd...

This preview shows document pages 1 - 3. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online