• For Input S3 location, type s3://region.elasticmapreduce.samplesReplace regionwith your region identifier.19
Amazon EMR Management GuideView the Results• For Output S3 location, type or browse to the outputbucket that you created in Create anAmazon S3 Bucket (p. 12).• For Action on failure, accept the default option Continue. This specifies that if the step fails, thecluster continues to run and processes subsequent steps. The Cancel and waitoption specifiesthat a failed step should be canceled, that subsequent steps should not run, abut that the clustershould continue running. The Terminate clusteroption specifies that the cluster should terminateif the step fails.5.Choose Add. The step appears in the console with a status of Pending.6.The status of the step changes from Pendingto Runningto Completedas the step runs. To updatethe status, choose the refresh icon to the right of the Filter. The script takes approximately a minuteto run.View the ResultsAfter the step completes successfully, the Hive query output is saved as a text file in the Amazon S3output folder that you specified when you submitted the step.To view the output of the Hive script1.Open the Amazon S3 console at .2.Choose the Bucket nameand then the folder that you set up earlier. For example, mybucketandthen MyHiveQueryResults.3.The query writes results to a folder within your output folder named os_requests. Choose that folder.There should be a single file named 000000_0in the folder. This is a text file that contains your Hivequery results.4.Choose the file, and then choose Downloadto save it locally.5.Use the text editor that you prefer to open the file. The output file shows the number of accessrequests ordered by operating system. The following example shows the output in WordPad:20
Amazon EMR Management GuideStep 5: Clean Up ResourcesStep 5: Terminate the Cluster and Delete theBucketAfter you complete the tutorial, you may want to terminate your cluster and delete your Amazon S3bucket to avoid additional charges.Terminating your cluster terminates the associated Amazon EC2 instances and stops the accrual ofAmazon EMR charges. Amazon EMR preserves metadata information about completed clusters foryour reference, at no charge, for two months. The console does not provide a way to delete terminatedclusters so that they aren't viewable in the console. Terminated clusters are removed from the clusterwhen the metadata is removed.To terminate the cluster1.Open the Amazon EMR console at .2.Choose Clusters, choose your cluster, and then choose Terminate.Clusters are often created with termination protection on, which helps prevent accidental shutdown.
As a current student on this bumpy collegiate pathway, I stumbled upon Course Hero, where I can find study resources for nearly all my courses, get online help from tutors 24/7, and even share my old projects, papers, and lecture notes with other students.
Temple University Fox School of Business ‘17, Course Hero Intern
I cannot even describe how much Course Hero helped me this summer. It’s truly become something I can always rely on and help me. In the end, I was not only able to survive summer classes, but I was able to thrive thanks to Course Hero.
University of Pennsylvania ‘17, Course Hero Intern
The ability to access any university’s resources through Course Hero proved invaluable in my case. I was behind on Tulane coursework and actually used UCLA’s materials to help me move forward and get everything together on time.
Tulane University ‘16, Course Hero Intern
Ask Expert Tutors
You can ask
You can ask ( soon)
You can ask
(will expire )