lect13-clouds

lect13-clouds - Notes ! Research project proposal due Wed....

Info iconThis preview shows pages 1–2. Sign up to view the full content.

View Full Document Right Arrow Icon
CMSC 714 Lecture 13 Cloud Computing - MapReduce Alan Sussman 2 CMSC 714 - Alan Sussman Notes ! Research project proposal due Wed. – everyone have a group to work with? – any other project questions? 3 MapReduce ! Both a programming model and a Google implementation for processing large data sets on clusters of commodity computers w/o a fast network – targeted data is mainly Web documents and related data, but could be applied to other domains ! Functional programming model, so processing order does not matter – user writes 2 functions: Map takes an input (key, value) pair and produces a set of intermediate (key, value) pairs Reduce takes a key, and all the corresponding values for the key from the intermediate pairs, and merges the values into a new set of values (sometimes just 1 value) • the intermediate values are given to the function via an iterator (helps when all values for a key don’t fit into memory) CMSC 714 - Alan Sussman 4 MapReduce (cont.) ! Implementation - runtime system does the parallelization
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 2
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 01/12/2012 for the course CMSC 714 taught by Professor Staff during the Fall '07 term at Maryland.

Page1 / 2

lect13-clouds - Notes ! Research project proposal due Wed....

This preview shows document pages 1 - 2. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online