Exploratory Data Analysis in Python

A course about how to approach a dataset for the first time

4.30 (262 reviews)
Udemy
platform
English
language
Data Science
category
instructor
Exploratory Data Analysis in Python
6,019
students
2 hours
content
Oct 2021
last update
$59.99
regular price

What you will learn

Exploring a dataset for calculating overall statistics

Visualize the correlations between the features

Visualize the predictive power of the features

Create useful insights from a dataset

Description

When we put our hands on a dataset for the first time, we can’t wait to test several models and algorithms. This is wrong because if we don’t know the information before feeding our model, the results will be unreliable and the model itself will surely fail. Moreover, if we don’t select the best features in advance, the training phase becomes slow and the model won’t learn anything useful.

So, the first approach we must have is to take a look at our dataset and visualize the information it contains. In other words, we have to explore it.

That’s the purpose of the Exploratory Data Analysis.

EDA is an important step of data science and machine learning. It helps us explore the information hidden inside a dataset before applying any model or algorithm. It makes heavy use of data visualization, it’s bias-free.

Moreover, it lets us figure out whether our features have predictive power or not, determining if the machine learning project we are working on has chances to be successful. Without EDA, we may give the wrong data to a model without reaching any success.

With this course, the student will learn:

  • How to visualize information that is hidden inside the dataset

  • How to visualize the correlation and the importance of the columns of a dataset

  • Some useful Python libraries

All the lessons are practical and made using Python programming language and Jupyter notebooks. All the notebooks are downloadable.

Content

Introduction

Introduction to the course
What is EDA?
The dataset
Required Python packages
Jupyter notebooks

Univariate analysis

A first sight to our dataset
Summarization
Histograms
Boxplots

Multivariate analysis

Pairplots
Correlation matrix and histograms
Stacked histograms

Some useful libraries

Sweetviz
Pandas profiling

General guidelines

Practical suggestions

Reviews

Juha
October 18, 2023
Id want more of explaining how to use the information shown by all the graphs. Generating them seems quite straight forward
Festus
July 5, 2023
In all, it's a nice course but the content should be updated and I think he should use a dataset with lots of rows and also add data cleaning
Dante
March 28, 2023
Good material, it needs an update, it should say the course is going to use only Jupyter, because some things are different in other IDE, those minor differences could be frustrating for beginners
Francesca
January 27, 2023
Corso breve ma efficace. Pienamente soddisfatta!! Soprattutto per la scoperta della libreria sweetviz che mi ha affascinato!
Mayra
February 1, 2022
Es un curso ideal para familiarizarte con el lenguaje de programación de python. Supero mis expectativas, lo recomiendo!
Veronica
January 4, 2022
Molto chiaro nella spiegazione, inglese corretto e comprensibile, lezioni sapientemente studiate nel contenuto e nella forma, che non trascurano i dettagli. Consigliato!

Charts

Price

Exploratory Data Analysis in Python - Price chart

Rating

Exploratory Data Analysis in Python - Ratings chart

Enrollment distribution

Exploratory Data Analysis in Python - Distribution chart
4354856
udemy ID
10/18/2021
course created date
10/21/2021
course indexed date
Bot
course submited by