Udemy

Platform

English

Language

Engineering

Category

Data Manipulation with Pandas Masterclass

Learn the main functions of Pandas for data analysis and visualization in less than 2 hours. Theory and hands-on.

4.42 (24 reviews)

Students

1 hour

Content

May 2021

Last Update
Regular Price


What you will learn

This is a short masterclass in Pandas, the most famous library for data manipulation in Python.

You will learn what Pandas is, and how it can help you load, manage, and transform tabular data.

Learn to analyze real world data using Python & Pandas.

Import data from multiple sources, clean, reshape, impute and visualize your data.

Use Python and Pandas to select, group and summarize your data.

Decide what data to keep and what to ignore.

Create compelling visualizations using Seaborn and Matplotlib.


Description

This masterclass introduces you to concepts and practices for building compelling analyses and dashboards on datasets of any size.  It is designed to be self contained and to be consumed quickly in a single session. It will get you up to speed from zero knowledge of Pandas to understanding how the library operates and using it in several different scenarios.

You will learn:

  • What tabular data is and where you find it

  • How Pandas allows you to load from, and save to, multiple data formats

  • How to use two main components of Pandas: the Series and the DataFrame

  • The main methods to select, group and summarize your data using Pandas

  • How to perform complex operations such as pivot tables and split-apply-combine

  • How to create compelling visualizations using Seaborn and Matplotlib directly from Pandas

The masterclass is designed to maximize the learning experience for everyone and includes 50% theory and 50% hands-on practice. It includes a lab with hands-on exercises and solutions.

No software installation required. You can run the code on Google CoLab and get started right away.

This class is the fastest way to get up to speed in Pandas.

Why Pandas?

Pandas is the most famous data manipulation library and it is used by millions of people every day to analyze and manipulate large datasets. It is mature, robust, easy to use and it has extensive documentation, so it's the perfect entry point for beginners and pros.


Screenshots

Data Manipulation with Pandas Masterclass
Data Manipulation with Pandas Masterclass
Data Manipulation with Pandas Masterclass
Data Manipulation with Pandas Masterclass

Content

Theory

Introduction

Agenda

Tabular Data

Data Manipulation with Pandas

Data Structures

Pandas IO

Selections & Filters

Question: Numpy & Pandas

Question: Indexes

Feature Engineering

Aggregations

Sort & Pivot

Joins

Time Series

Question: in memory

Other Commands

Data Visualization

Lab & Exercises

Lab Start

Lab Part 1

Lab Part 2

Question Python & R

Lab Part 2

Lab Exercise 1 - Prompt

Lab Exercise 1 - Solution

Lab Part 3

Lab Exercise 2


Reviews

t
tee7 May 2021

Excellent content to cover in a high level summary of pandas functionality. The lab exercises were an excellent way to apply what was reviewed in the lectures. Thank you!


3438982

Udemy ID

8/22/2020

Course created date

5/11/2021

Course Indexed date
Angelcrc Seven
Course Submitted by