Batch Processing with Apache Beam in Python

Easy to follow, hands-on introduction to batch data processing in Python

3.35 (59 reviews)
Udemy
platform
English
language
Data Science
category
instructor
Batch Processing with Apache Beam in Python
276
students
1 hour
content
Sep 2020
last update
$34.99
regular price

What you will learn

Core concepts of the Apache Beam framework

How to design a pipeline in Apache Beam

How to install Apache Beam locally

How to build a real-world ETL pipeline in Apache Beam

How to read and write CSV data from Apache Beam

How to apply built-in and custom transformations on a dataset

How to deploy your pipeline to Cloud Dataflow on Google Cloud

Why take this course?

Apache Beam is an open-source programming model for defining large scale ETL, batch and streaming data processing pipelines. It is used by companies like Google, Discord and PayPal.

In this course you will learn Apache Beam in a practical manner, with every lecture comes a full coding screencast. By the end of the course you'll be able to build your own custom batch data processing pipeline in Apache Beam.

This course includes 20 concise bite-size lectures and a real-life coding project that you can add to your Github portfolio! You're expected to follow the instructor and code along with her.

You will learn:

  • How to install Apache Beam on your machine

  • Basic and advanced Apache Beam concepts

  • How to develop a real-world batch processing pipeline

  • How to define custom transformation steps

  • How to deploy your pipeline on Cloud Dataflow

This course is for all levels. You do not need any previous knowledge of Apache Beam or Cloud Dataflow.

Screenshots

Batch Processing with Apache Beam in Python - Screenshot_01Batch Processing with Apache Beam in Python - Screenshot_02Batch Processing with Apache Beam in Python - Screenshot_03Batch Processing with Apache Beam in Python - Screenshot_04

Reviews

Tristan
November 2, 2022
Beam is a huge and complex framework, so I bought this course hoping to gain some fundamentals. I came away feeling confident and like it was tailored to my skill level at the time. Would 100% recommend as an icebreaker
Pablo
July 21, 2022
Did not go into much detail or provide source code to compare with locally. Authentication area could really use some detail.
Gabriel
March 23, 2022
What an awesome introduction course, nice to have a brief overview of Apache Beam Python SDK and Dataflow. Awesome course, thank you!!
Carlos
November 28, 2021
It would be perfect if she gave us some exercises and the solution as a code file. So we can learn a bit more and interact.
Thiago
October 18, 2020
Very concise, clear e direct introduction to Apache Beam and Google Cloud Dataflow! Alexandra is also very helpful in answering the questions. Thank you so much!
Tom
October 17, 2020
Good course, to the point which is great. Lacking some common 'watch-outs' which would have been helpful. Thanks

Charts

Price

Batch Processing with Apache Beam in Python - Price chart

Rating

Batch Processing with Apache Beam in Python - Ratings chart

Enrollment distribution

Batch Processing with Apache Beam in Python - Distribution chart

Related Topics

3535406
udemy ID
9/29/2020
course created date
10/7/2020
course indexed date
Bot
course submited by