The Data Analyst Course: Complete Data Analyst Bootcamp

Complete Data Analyst Training: Python, NumPy, Pandas, Data Collection, Preprocessing, Data Types, Data Visualization

4.55 (15340 reviews)
Udemy
platform
English
language
Data & Analytics
category
instructor
The Data Analyst Course: Complete Data Analyst Bootcamp
113,900
students
21 hours
content
Mar 2024
last update
$139.99
regular price

What you will learn

The course provides the complete preparation you need to become a data analyst

Fill up your resume with in-demand data skills: Python programming, NumPy, pandas, data preparation - data collection, data cleaning, data preprocessing, data visualization; data analysis, data analytics

Acquire a big picture understanding of the data analyst role

Learn beginner and advanced Python

Study mathematics for Python

We will teach you NumPy and pandas, basics and advanced

Be able to work with text files

Understand different data types and their memory usage

Learn how to obtain interesting, real-time information from an API with a simple script

Clean data with pandas Series and DataFrames

Complete a data cleaning exercise on absenteeism rate

Expand your knowledge of NumPy – statistics and preprocessing

Go through a complete loan data case study and apply your NumPy skills

Master data visualization

Learn how to create pie, bar, line, area, histogram, scatter, regression, and combo charts

Engage with coding exercises that will prepare you for the job

Practice with real-world data

Solve a final capstone project

Why take this course?

The problem

Most data analyst, data science, and coding courses miss a critical practical step. They don’t teach you how to work with raw data, how to clean, and preprocess it. This creates a sizeable gap between the skills you need on the job and the abilities you have acquired in training. Truth be told, real-world data is messy, so you need to know how to overcome this obstacle to become an independent data professional.

The bootcamps we have seen online and even live classes neglect this aspect and show you how to work with ‘clean’ data. But this isn’t doing you a favour. In reality, it will set you back both when you are applying for jobs, and when you’re on the job.

The solution

Our goal is to provide you with complete preparation. And this course will turn you into a job-ready data analyst. To take you there, we will cover the following fundamental topics extensively.

  • Theory about the field of data analytics

  • Basic Python

  • Advanced Python

  • NumPy

  • Pandas

  • Working with text files

  • Data collection

  • Data cleaning

  • Data preprocessing

  • Data visualization

  • Final practical example

Each of these subjects builds on the previous ones. And this is precisely what makes our curriculum so valuable. Everything is shown in the right order and we guarantee that you are not going to get lost along the way, as we have provided all necessary steps in video (not a single one skipped). In other words, we are not going to teach you how to analyse data before you know how to gather and clean it.

So, to prepare you for the entry-level job that leads to a data science position - data analyst - we created The Data Analyst Course.

This is a rather unique training program because it teaches the fundamentals you need on the job. A frequently neglected aspect of vital importance.

Moreover, our focus is to teach topics that flow smoothly and complement each other. The course provides complete preparation for someone who wants to become a data analyst at a fraction of the cost of traditional programs (not to mention the amount of time you will save). We believe that this resource will significantly boost your chances of landing a job, as it will prepare you for practical tasks and concepts that are frequently included in interviews.

The topics we will cover

1. Theory about the field of data analytics

2. Basic Python

3. Advanced Python

4. NumPy

5. Pandas

6. Working with text files

7. Data collection

8. Data cleaning

9. Data preprocessing

10. Data visualization

11. Final practical example


1. Theory about the field of data analytics

Here we will focus on the big picture. But don’t imagine long boring pages with terms you’ll have to check up in a dictionary every minute. Instead, this is where we want to define who a data analyst is, what they do, and how they create value for an organization.

Why learn it?

You need a general understanding to appreciate how every part of the course fits in with the rest of the content. As they say, if you know where you are going, chances are that you will eventually get there. And since data analyst and other data jobs are relatively new and constantly evolving, we want to provide you with a good grasp of the data analyst role specifically. Then, in the following chapters, we will teach you the actual tools you need to become a data analyst.

2. Basic Python

This course is centred around Python. So, we’ll start from the very basics. Don’t be afraid if you do not have prior programming experience.

Why learn it?

You need to learn a programming language to take full advantage of the data-rich world we live in. Unless you are equipped with such a skill, you will always be dependent on other people’s ability to extract and manipulate data, and you want to be independent while doing analysis, right? Also, you don’t necessarily need to learn many programming languages at once. It is enough to be very skilled at just one, and we’ve naturally chosen Python which has established itself as the number one language for data analysis and data science (thanks to its rich libraries and versatility).

3. Advanced Python

We will introduce advanced Python topics such as working with text data and using tools such as list comprehensions and anonymous functions.

Why learn it?

These lessons will turn you into a proficient Python user who is independent on the job. You will be able to use Python’s core strengths to your advantage. So, here it is not just about the topics, it is also about the depth in which we explore the most relevant Python tools.

4. NumPy

NumPy is Python’s fundamental package for scientific computing. It has established itself as the go-to tool when you need to compute mathematical and statical operations.

Why learn it?

A large portion of a data analyst’s work is dedicated to preprocessing datasets. Unquestionably, this involves tons of mathematical and statistical techniques that NumPy is renowned for. In addition, the package introduces multi-dimensional array structures and provides a plethora of built-in functions and methods to use while working with them. In other words, NumPy can be described as a computationally stable state-of-the-art Python instrument that provides flexibility and can take your analysis to the next level.

5. Pandas

The pandas library is one of the most popular Python tools that facilitate data manipulation and analysis. It is very valuable because you can use it to manipulate all sorts of information - numerical tables and time series data, as well as text.

Why learn it?

Pandas is the other main tool an analyst needs to clean and preprocess the data they are working with. Its data manipulation features are second to none in Python because of the diversity and richness it provides in terms of methods and functions. The combined ability to work with both NumPy and pandas is extremely powerful as the two libraries complement each other. You need to be capable to operate with both to produce a complete and consistent analysis independently.

6. Working with text files

Exchanging information with text files is practically how we exchange information today. In this part of the course, we will use the Python, pandas, and NumPy tools learned earlier to give you the essentials you need when importing or saving data.

Why learn it?

In many courses, you are just given a dataset to practice your analytical and programming skills. However, we don’t want to close our eyes to reality, where converting a raw dataset from an external file into a workable Python format can be a massive challenge.

7. Data collection

In the real world, you don’t always have the data readily available for you. In this part of the course, you will learn how to retrieve data from an API.

Why learn it?

You need to know how to source your data, right? To be a well-rounded analyst you must be able to collect data from outside sources. This is rarely a one-click process. This section aims at providing you with all the necessary tools to do that on your own.

8. Data cleaning

The next logical step is to clean your data. This is where you will apply the pandas skills acquired earlier in practice. All lessons throughout the course have a real-world perspective.

Why learn it?

A large part of a data analyst’s job in the real world involves cleaning data and preparing it for the actual analysis. You can’t expect that you’ll deal with flawless data sources, right? So, it will be up to you to overcome this stage and clean your data.

9. Data preprocessing

Even when your dataset is clean and in an understandable shape, it isn’t quite ready to be processed for visualizations and analysis just yet. There is a crucial step in between, and that’s data preprocessing.

Why learn it?

Data preprocessing is where a data analyst can demonstrate how good or great they are at their job. This stage of the work requires the ability to choose the right statistical tool that will improve the quality of your dataset and the knowledge to implement it with advanced pandas and NumPy techniques. Only when you’ve completed this step can you say that your dataset is preprocessed and ready for the next part, which is data visualization.

10. Data visualization

Data visualization is the face of data. Many people look at the data and see nothing. The reason for that is that they are not creating good visualizations. Or even worse – they are creating nice graphs but cannot interpret them accurately.

Why learn it?

This part of the course will teach you how to use your data to produce meaningful insights. At the end of the day, data charts are what conveys the most information in the shortest amount of time. And nothing speaks better than a well crafted and meaningful data visualization.

11. Practical example

The course contains plenty of exercises and practical cases. In the end, we have included a comprehensive practical example that will show you how everything you have learned along the way comes nicely together. This is where you will be able to appreciate how far you have come in your journey to becoming a data analyst and starting your data career.

What you get

  • A program worth $1,250

  • Active Q&A support

  • All the knowledge to become a data analyst

  • A community of aspiring data analysts

  • A certificate of completion

  • Access to frequent future updates

  • Real-world training

  • Get ready to become a data analyst from scratch

Why wait? Every day is a missed opportunity.

Click the “Buy Now” button and become a part of our data analyst program today.

Content

Introduction to the Course

A Practical Example - What Will You Learn in This Course?
What Does the Course Cover?

Introduction to Data Analytics

Introduction to the World of Business and Data
Relevant Terms Explained
Data Analyst Compared to Other Data Jobs
Data Analyst Job Description
Why Python

Setting up the Environment

Introduction
Programming Explained in a Few Minutes
Programming Explained in a Few Minutes
Jupyter - Introduction
Jupyter - Installing Anaconda
Jupyter - Intro to Using Jupyter
Jupyter - Working with Notebook Files
Jupyter - Using Shortcuts
Jupyter - Handling Error Messages
Jupyter - Restarting the Kernel
Jupyter - Introduction

Python Basics

Python Variables
Python Variables
Types of Data - Numbers and Boolean Values
Types of Data - Numbers and Boolean Values
Types of Data - Strings
Types of Data - Strings
Basic Python Syntax - Arithmetic Operators
Basic Python Syntax - Arithmetic Operators
Basic Python Syntax - The Double Equality Sign
Basic Python Syntax - The Double Equality Sign
Basic Python Syntax - Reassign Values
Basic Python Syntax - Reassign Values
Basic Python Syntax - Add Comments
Basic Python Syntax - Add Comments
Basic Python Syntax - Line Continuation
Basic Python Syntax - Indexing Elements
Basic Python Syntax - Indexing Elements
Basic Python Syntax - Indentation
Basic Python Syntax - Indentation
Operators - Comparison Operators
Operators - Comparison Operators
Operators - Logical and Identity Operators
Operators - Logical and Identity Operators
Conditional Statements - The IF Statement
Conditional Statements - The IF Statement
Conditional Statements - The ELSE Statement
Conditional Statements - The ELIF Statement
Conditional Statements - A Note on Boolean Values
Conditional Statements - A Note on Boolean Values
Functions - Defining a Function in Python
Functions - Creating a Function with a Parameter
Functions - Another Way to Define a Function
Functions - Using a Function in Another Function
Functions - Combining Conditional Statements and Functions
Functions - Creating Functions That Contain a Few Arguments
Functions - Notable Built-in Functions in Python
Functions
Sequences - Lists
Sequences - Lists
Sequences - Using Methods
Sequences - Using Methods
Sequences - List Slicing
Sequences - Tuples
Sequences - Dictionaries
Sequences - Dictionaries
Iteration - For Loops
Iteration - For Loops
Iteration - While Loops and Incrementing
Iteration - Create Lists with the range() Function
Iteration - Create Lists with the range() Function
Iteration - Use Conditional Statements and Loops Together
Iteration - Conditional Statements, Functions, and Loops
Iteration - Iterating over Dictionaries

Fundamentals for Coding in Python

Object-Oriented Programming (OOP)
Modules, Packages, and the Python Standard Library
Importing Modules
Introduction to Using NumPy and pandas
What is Software Documentation?
The Python Documentation

Mathematics for Python

What Is а Matrix?
Scalars and Vectors
Linear Algebra and Geometry
Arrays in Python
What Is a Tensor?
Adding and Subtracting Matrices
Errors When Adding Matrices
Transpose
Dot Product of Vectors
Dot Product of Matrices
Why is Linear Algebra Useful

NumPy Basics

The NumPy Package and Why We Use It
Installing/Upgrading NumPy
Ndarray
The NumPy Documentation
NumPy Basics - Exercise

Pandas - Basics

Introduction to the pandas Library
Installing and Running pandas
Introduction to pandas Series
Working with Attributes in Python
Using an Index in pandas
Label-based vs Position-based Indexing
More on Working with Indices in Python
Using Methods in Python - Part I
Using Methods in Python - Part II
Parameters vs Arguments
the pandas Documentation
Introduction to pandas DataFrames
Creating DataFrames from Scratch - Part I
Creating DataFrames from Scratch - Part II
Additional Notes on Using DataFrames
pandas Basics - Conclusion

Working with Text Files

Working with Files in Python - An Introduction
File vs File Object, Read vs Parse
Structured vs Semi-Structured and Unstructured Data
Data Connectivity through Text Files
Principles of Importing Data in Python
More on Text Files (*.txt vs *.csv)
Fixed-width Files
Common Naming Conventions Used in Programming
Importing Text Files in Python ( open() )
Importing Text Files in Python ( with open() )
Importing *.csv Files with pandas - Part I
Importing *.csv Files with pandas - Part II
Importing *.csv Files with pandas - Part III
Importing Data with the "index_col" Parameter
Importing Data with NumPy - .loadtxt() vs genfromtxt()
Importing Data with NumPy - Partial Cleaning While Importing
Importing Data with NumPy - Exercise
Importing *.json Files
Prelude to Working with Excel Files in Python
Working with Excel Data (the *.xlsx Format)
An Important Exercise on Importing Data in Python
Importing Data with the pandas' "Squeeze" Parameter
A Note on Importing Files in Jupyter
Saving Your Data with pandas
Saving Your Data with NumPy - np.save()
Saving Your Data with NumPy - np.savez()
Saving Your Data with NumPy - np.savetxt()
Saving Your Data with NumPy - Exercise
Working with Text Files - Conclusion

Working with Text Data

Using the .format() Method

Must-Know Python Tools

Iterating Over Range Objects
Nested For Loops - Introduction
Triple Nested For Loops
List Comprehensions
Anonymous (Lambda) Functions

Data Gathering/Data Collection

What is data gathering/data collection?

APIs (POST requests are not needed for this course)

Overview of APIs
GET and POST Requests
Data Exchange Format for APIs: JSON
Introducing the Exchange Rates API
Including Parameters in a GET Request
More Functionalities of the Exchange Rates API
Coding a Simple Currency Conversion Calculator
iTunes API
iTunes API: Homework
iTunes API: Structuring and Exporting the Data
Pagination: GitHub API
APIs: Exercise

Data Cleaning and Data Preprocessing

Data Cleaning and Data Preprocessing

pandas Series

.unique(), .nunique()
Converting Series into Arrays
.sort_values()
Attribute and Method Chaining
.sort_index()

NumPy Fundamentals

Indexing in NumPy
Assigning Values in NumPy
Elementwise Properties of Arrays
Types of Data Supported by NumPy
Characteristics of NumPy Functions Part 1
Characteristics of NumPy Functions Part 2
NumPy Fundamentals - Exercise

NumPy DataTypes

ndarrays
Arrays vs Lists
Strings vs Object vs Number
NumPy DataTypes - Exercise

Working with Arrays

Basic Slicing in NumPy
Stepwise Slicing in NumPy
Conditional Slicing in NumPy
Dimensions and the Squeeze Function
Working with Arrays - Exercise

Generating Data with NumPy

Arrays of 0s and 1s
"_like" functions in NumPy
A Non-Random Sequence of Numbers
Random Generators and Seeds
Basic Random Functions in NumPy
Probability Distributions in NumPy
Applications of Random Data in NumPy
Generating Data with NumPy - Exercise

Statistics with NumPy

Using Statistical Functions in NumPy
Minimal and Maximal Values in NumPy
Statistical Order Functions in NumPy
Averages and Variance in NumPy
Covariance and Correlation in NumPy
Histograms in NumPy (Part 1)
Histograms in NumPy (Part 2)
NAN Equivalent Functions in NumPy
Statistics with NumPy - Exercise

NumPy - Preprocessing

Checking for Missing Values in Ndarrays
Substituting Missing Values in Ndarrays
Reshaping Ndarrays
Removing Values from Ndarrays
Sorting Ndarrays
Argument Sort in NumPy
Argument Where in NumPy
Shuffling Ndarrays
Casting Ndarrays
Striping Values from Ndarrays
Stacking Ndarrays
Concatenating Ndarrays
Finding Unique Values in Ndarrays

The "Absenteeism" Exercise - Introduction

An Introduction to the "Absenteeism" Exercise
The "Absenteeism" Exercise from a Business Perspective
The Dataset

Solution to the "Absenteeism" Exercise

How to Complete the Absenteeism Exercise
Eyeball Your Data First
Note: Programming vs the Rest of the World
Using a Statistical Approach to Solve Our Exercise
Dropping the 'ID' Column
Analysis of the 'Reason for Absence' Column
Splitting the Reasons for Absence into Multiple Dummy Variables
Working with Dummy Variables - A Statistical Perspective
Grouping the Reason for Absence Columns
Concatenating Columns in a pandas DataFrame
Reordering Columns in a DataFrame
Working on the 'Date' Column
Extracting the Month Value from the 'Date' Column
Creating the 'Day of the Week' Column
Understanding the Meaning of 5 More Columns
Modifying the 'Education' Column
Final Remarks on the Absenteeism Exercise

Data Visualization

What Is Data Visualization and Why Is It Important?
Why Learn Data Visualization?
Choosing the Right Visualization – What Are Some Popular Approaches and Framewor
Introduction into Colors and Color Theory
Bar Chart - Introduction - General Theory and Getting to Know the Dataset
Bar Chart - How to Create a Bar Chart Using Python
Bar Chart – Interpreting the Bar Graph. How to Make a Good Bar Graph
Pie Chart - Introduction - General Theory and Dataset
Pie Chart - How to Create a Pie Chart Using Python
Pie Chart – Interpreting the Pie Chart
Pie Chart - Why You Should Never Create a Pie Graph
Stacked Area Chart - Introduction - General Theory. Getting to Know the Dataset
Stacked Area Chart - How to Create a Stacked Area Chart Using Python
Stacked Area Chart - Interpreting the Stacked Area Graph
Stacked Area Chart - How to Make a Good Stacked Area Chart
Line Chart - Introduction - General Theory. Getting to Know the Dataset
Line Chart - How to Create a Line Chart in Python
Line Chart - Interpretation
Line Chart - How to Make a Good Line Chart
Histogram - Introduction - General Theory. Getting to Know the Dataset
Histogram - How to Create a Histogram Using Python
Histogram – Interpreting the Histogram
Histogram – Choosing the Number of Bins in a Histogram
Histogram - How to Make a Good Histogram
Scatter Plot - Introduction - General Theory. Getting to Know the Dataset
Scatter Plot - How to Create a Scatter Plot Using Python
Scatter Plot – Interpreting the Scatter Plot
Scatter Plot - How to Make a Good Scatter Plot
Regression Plot - Introduction - General Theory. Getting to Know the Dataset
Regression Plot - How to Create a Regression Scatter Plot Using Python
Regression Plot – Interpreting the Regression Scatter Plot
Regression Plot - How to Make a Good Regression Plot
Bar and Line Chart - Introduction - General Theory. Getting to Know the Dataset
Bar and Line Chart - How to Create a Combination Bar and Line Graph Using Python
Bar and Line Chart – Interpreting the Combination Bar and Line Graph
Bar and Line Chart – How to Make a Good Bar and Line Graph
Data Visualization - Exercise

Conclusion

Conclusion

Screenshots

The Data Analyst Course: Complete Data Analyst Bootcamp - Screenshot_01The Data Analyst Course: Complete Data Analyst Bootcamp - Screenshot_02The Data Analyst Course: Complete Data Analyst Bootcamp - Screenshot_03The Data Analyst Course: Complete Data Analyst Bootcamp - Screenshot_04

Reviews

Grant
November 10, 2023
The content I've learned is helpful, but these guys DO NOT update their videos, exercises, code etc. A lot of the content is outdated and it gets very tiring relying on other students to find the right answers to anything. Proceed with caution.
Wei
November 10, 2023
You can learn the full concepts and coding skills of NumPy and Pandas. The two important modules in data preprocessing.
antony
November 8, 2023
i cnt work with the coding exercise number 14. But all the possible ways but still it indicates error
Pankhuri
November 4, 2023
Data science And Course on Python were more good then this one, although this is good too, but they were very well explained
DAVID
November 3, 2023
The course was good, however it needs to be updated because several attributes/formulas/formatting have been modified in the latest versions, so the code showcased in the videos didn't always work.
Brett
November 1, 2023
Covers good content, but the instructors cover concepts far too quickly. It is advisable to become familiar with the basics of Python before doing this course. Even though there are exercises to practice the concepts you've just seen, not everything is as intuitive as the instructors make it seem in the videos. A basic understanding of Python will help A LOT before doing this course.
Devansh
October 31, 2023
It was a great experience and allowed me to develop my python abilities as well as refresh some prior knowledge. Would highly recommend to others although the IDE, is sometimes a little buggy.
Tarun
October 29, 2023
My experience is good as of now, the teaching method is good with the practical approach with the help of quizzes.
Joshua
October 27, 2023
I am now appreciating the value of this lesson as compared to what and how it was taught in the classroom
Ricardo
October 27, 2023
El curso es Excelente, muy bien explicado ! Algunas cosas ya no están vigentes pero nada que una googleada rapida no pueda resolver ! esto contribuye a nuestra habilidad para googlear supongo !!! haha ! Excelente curso, recomendado !
Jacqueline
October 25, 2023
I like how analysis and analytics is explained. Now I have a better understanding of the two methodologies. Thanks.
Jason
October 25, 2023
Sometimes even if you directly copy the solution from solutions manual the exercise will still fail. For example, Exercise 60. This is a function designed to test one argument, but the instructions want you to call it for two different arguments... thus there is no way to correctly enter calling the function twice with two different arguments that will be recorded as successfully completing the "Run tests" option. You can call it for one of the arguments they request or the other and that isn't correct ... or you can try to call it for both and that isn't correct.
Renata
October 24, 2023
I loved the python portion of it! Vey easy to understanding but, the modules on APIS and Pandas Series are outdated! This should be fixed ASAP, I spent a lot of time trying to figure out how it was suppose to be, because the lecture didn't explain the update way.
Jonathan
October 24, 2023
Taught me well on how to code in Python and deal with Data Visualization. At the moment though, I would say that it has one flaw and that's the API section that no longer works in 2023. Q&A section notes this. At the very least, they have a video that actually takes care of that one section. The rest of it is very informative to a person looking to look at the basics of how to be a Data Analyst. No shortcuts or special applications. You learn from ground 0 to principles and much more.
Chastan
October 21, 2023
Fantastic! The information is presented in a very easy to digest method. In depth description at a pace that doesn't overwhelm you.

Coupons

DateDiscountStatus
10/27/202095% OFF
expired
1/28/2021100% OFF
expired
6/24/202194% OFF
expired

Charts

Price

The Data Analyst Course: Complete Data Analyst Bootcamp - Price chart

Rating

The Data Analyst Course: Complete Data Analyst Bootcamp - Ratings chart

Enrollment distribution

The Data Analyst Course: Complete Data Analyst Bootcamp - Distribution chart

Related Topics

3570337
udemy ID
10/15/2020
course created date
10/27/2020
course indexed date
Bot
course submited by