HW1_new version.docx - HW 1 Map Reduce Shape Counting\u2014Lecture 3 Name ID MapReduce is the core programming model for the Hadoop Ecosystem Your job is

HW1_new version.docx - HW 1 Map Reduce Shape...

This preview shows page 1 out of 1 page.

HW 1 Map Reduce: Shape Counting—Lecture 3 Name: ID: MapReduce is the core programming model for the Hadoop Ecosystem. Your job is to perform the steps of MapReduce to calculate a count of the number of squares, stars, circles, hearts and triangles in the dataset shown in the picture above. Step 0: Store the dataset across 4 partitions in HDFS. Note: we have already done one partition for you. Hint: Balance the load, but there is more than on possible “correct” partitioning. Step 1: Map the data. Hint: Mapping involves clustering like keys together. Show this in the visual placement of keys within a partition. Step 2: Sort and Shuffle. Note: You don’t have to use the same number of nodes in this step as you did before. Let’s use three instead. Hint: Balance the load.
Image of page 1

You've reached the end of your free preview.

Want to read the whole page?

  • Spring '19
  • Hadoop

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture