Overcoming Common Performance Issues in Apache Spark

Speed up your Spark Scripts and overcome errors

4.35 (22 reviews)
Udemy
platform
English
language
Databases
category
instructor
Overcoming Common Performance Issues in Apache Spark
460
students
40 mins
content
Apr 2023
last update
$19.99
regular price

What you will learn

The three main causes of performance issues in Apache Spark

How to overcome shuffle induced performance issues in Apache Spark

How to overcome skew induced performance issues in Apache Spark

How to overcome spill induced performance issues in Apache Spark

Why take this course?

Spark is a powerful framework for processing large datasets in parallel. But, with the complex architecture come frequent performance issues.


In my experience, it can be frustrating looking everywhere, trying to find a resource online that is worded in such a way that you fully understand the inner workings of Spark and how to address these issues. So, I created this course!


This is not a code-along course. This course assumes you already know how to code in Spark. Here, we're talking about how you resolve the performance issues that you encounter during your development journey! We will walk through all of the theory & you'll have actionable steps to take to resolve your performance issues.


In this course, we will cover off:

  • The Apache Spark Architecture

  • The type of deployment modes in Apache Spark

  • The structure of jobs in Apache Spark

  • How to handle the three main performance concerns in Spark

If you don't yet know how to code in Spark, you can join my 60 minute crash course in PySpark, here on Udemy.


Let's get to work understanding why your scripts are not performing as you may hope and resolve your performance issues together. Shuffle, Skew and Spill will be concerns of the past after this course!

Screenshots

Overcoming Common Performance Issues in Apache Spark - Screenshot_01Overcoming Common Performance Issues in Apache Spark - Screenshot_02Overcoming Common Performance Issues in Apache Spark - Screenshot_03Overcoming Common Performance Issues in Apache Spark - Screenshot_04

Reviews

Bartlomiej
May 10, 2023
Great overview of Spark and the solutions to some common performance issues, would definitely recommend for anyone starting off with spark!
Kenneth
April 20, 2023
Brilliant course, very relevant to my work in Data Analytics (working with large datasets in AWS cloud). The presenter speaks well and is very clear, and the course moves at a good tempo.

Charts

Price

Overcoming Common Performance Issues in Apache Spark - Price chart

Rating

Overcoming Common Performance Issues in Apache Spark - Ratings chart

Enrollment distribution

Overcoming Common Performance Issues in Apache Spark - Distribution chart

Related Topics

5274746
udemy ID
4/15/2023
course created date
4/21/2023
course indexed date
Bot
course submited by