00_Python_EDA.docx - Exploratory Data Analytics What is EDA...

This preview shows page 1 - 3 out of 5 pages.

Exploratory Data Analytics What is EDA? EDA means Exploration of Data for Analysis Used to analyze a dataset features/attributes to summarize its key characteristics What can data tell us quickly so that we can form some hypothesis What key characteristics? 5 number summary i.e. min, 25th percentile, median, 75th percentile and max Other basic statistics like average, standard deviation Understand how the data is distributed over various parameters Data distribution is presented visually using graphs and charts What will we do in this EDA exercise? #movies, #ratings, #users Genre distribution as a pie chart 5-point summary of the rating attribute Rating distribution as a histogram Top ranked movies Find awesome masala movies to watch Python Basics Importing numpy, pandas, matplotlib and seaborn in python import numpy as np import pandas as pd import seaborn as sns import matplotlib.pyplot as plt %matplotlib inline from scipy import stats slope, intercept, r_value, p_value, std_err = stats.linregress(df[‘height’], df[‘weight’]) slope = 16.783524424282902 , intercept = -37.45428562014031
Image of page 1
EDA using Python We will work on our Movielens dataset using the "Pandas" package. Pandas makes working with Tabular data very easy as we will see import pandas as pd Read the movies.csv file and create a Pandas DataFrame called movies_df movies_df = pd.read_csv('13_Movies.csv') Now let’s peek into this data frame object using its head function movies_df.head() Now let’s see what shape is, i.e., number of rows and number of columns in the DataFrame
Image of page 2
Image of page 3

You've reached the end of your free preview.

Want to read all 5 pages?

  • Fall '19

What students are saying

  • Left Quote Icon

    As a current student on this bumpy collegiate pathway, I stumbled upon Course Hero, where I can find study resources for nearly all my courses, get online help from tutors 24/7, and even share my old projects, papers, and lecture notes with other students.

    Student Picture

    Kiran Temple University Fox School of Business ‘17, Course Hero Intern

  • Left Quote Icon

    I cannot even describe how much Course Hero helped me this summer. It’s truly become something I can always rely on and help me. In the end, I was not only able to survive summer classes, but I was able to thrive thanks to Course Hero.

    Student Picture

    Dana University of Pennsylvania ‘17, Course Hero Intern

  • Left Quote Icon

    The ability to access any university’s resources through Course Hero proved invaluable in my case. I was behind on Tulane coursework and actually used UCLA’s materials to help me move forward and get everything together on time.

    Student Picture

    Jill Tulane University ‘16, Course Hero Intern

Stuck? We have tutors online 24/7 who can help you get unstuck.
A+ icon
Ask Expert Tutors You can ask You can ask ( soon) You can ask (will expire )
Answers in as fast as 15 minutes
A+ icon
Ask Expert Tutors