View the step-by-step solution to:


1) Copy the notebook to your google account.

2) Import the pandas library and alias it as


3) Read in the CSV dataset that is found at the following URL:

4) Print out the shape as well as the first 5 rows of the dataframe.

5) Print out the datatypes of the dataframe columns (dataset features).

6) Print out the summary statistics of the numeric values of your dataset i.e. min, max, mean, standard deviation, etc.

6.1) Describe how you addressed the NaN values and give an explanation justifying your decision.

7) Create scatter plots using Matplotlib. Can you find any interesting relationships in the data? Be sure to label your axis and to give your graphs a title.

Screenshot cool graphs that you create

and share them with the slack channel. 

Don't forget to import matplotlib before trying to use it.

8) STRETCH GOAL (Extra Credit)

Machine Learning algorithms don't do well with categorical values that are represented by strings. In order to have this dataset completely cleaned we need to transform the categorical variables that are represented as strings into numeric categorical variables


Recently Asked Questions

Why Join Course Hero?

Course Hero has all the homework and study help you need to succeed! We’ve got course-specific notes, study guides, and practice tests along with expert tutors.

  • -

    Study Documents

    Find the best study resources around, tagged to your specific courses. Share your own to gain free Course Hero access.

    Browse Documents
  • -

    Question & Answers

    Get one-on-one homework help from our expert tutors—available online 24/7. Ask your own questions or browse existing Q&A threads. Satisfaction guaranteed!

    Ask a Question
Let our 24/7 Object-Oriented Programming tutors help you get unstuck! Ask your first question.
A+ icon
Ask Expert Tutors You can ask You can ask You can ask (will expire )
Answers in as fast as 15 minutes
A+ icon
Ask Expert Tutors