Course Hero Logo

大数据分析 R语言实现_173.docx - Chapter 4 In this section, you've...

Course Hero uses AI to attempt to automatically extract content from documents to surface to you and others so you can study better, e.g., in search results, to enrich docs, and more. This preview shows page 1 out of 1 page.

In this section, you've learned how to create and configure a fully operational Linux virtualmachine with a trial version of Hadoop distribution offered by Hortonworks Sandbox. Wehave also revised several frequently used Linux commands and tools which allow us to obtainbasic information about the deployed infrastructure and to monitor processes and their usageof available resources on the machine.In the following section, we will introduce you to essential Hadoop commands that will enableyou to manage data files within HDFS and perform a simple MapReduce task in Java to obtainword count information, as described in the first part of this chapter.A word count example in Hadoop using JavaEarlier in this chapter, we explained how the HDFS and MapReduce frameworks work bygiving you an example of a very simplified word count task applied to a few randomsentences. In this section, you will implement a similar word count MapReduce job yourself,
End of preview. Want to read the entire page?

Upload your study docs or become a

Course Hero member to access this document

Term
Fall
Professor
wqdwqdwq
Tags
Hadoop, Project Gutenberg, Gutenberg

Newly uploaded documents

Show More

Newly uploaded documents

Show More

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture