Personal_3.MapReduce An Introduction - Map Reduce - Ghana -...

This preview shows page 1 - 9 out of 26 pages.

The preview shows page 7 - 9 out of 26 pages.
Map Reduce- an overview-Ghana
AGENDAComparison of Hadoop with othersystemsRDBMSGrid ComputingMap Reduce- An IntroductionWord count – defaultWord count – custom
AGENDAUnderstanding MapReduceMap Reduce- An IntroductionWord count – defaultWord count – custom
Why not RDBMSSeek time improving slowly than read/writetime
Map ReduceProgramming model to process largedatasetsSupported languages for MRJavaRubyPythonC++
Understanding MapReduceStart with WORDCOUNT example“Do as I say, not as I do”WordCountAs2Do2I2Not2Say1
Understanding MapReducepseudocodedefine wordCount as Map<String,long>;for each document in documentSet {T = tokenize(document);for each token in T {wordCount[token]++;}}
Understanding MapReduce-pseudo codeSpam filterMillions of emailsWord count for analysisWorking from a single computer is timeconsumingRewrite the program to count formmultiple machines

Upload your study docs or become a

Course Hero member to access this document

Upload your study docs or become a

Course Hero member to access this document

End of preview. Want to read all 26 pages?

Upload your study docs or become a

Course Hero member to access this document

Term
Fall
Professor
KERSEY
Tags
Grid Computing, Word Count, Word processor, Hadoop

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture