Practical Data Engineering in GCP: Beginner to Advanced

Step by step guide to building four data pipelines in Google Cloud using DataStream, Data Fusion, DataPrep, DataFlow etc

4.00 (79 reviews)
Udemy
platform
English
language
Other
category
instructor
Practical Data Engineering in GCP: Beginner to Advanced
375
students
4.5 hours
content
Mar 2022
last update
$44.99
regular price

What you will learn

How to build No Code/Codeless data pipelines in Google Cloud

You will learn to build real-world data pipelines usings tools like Data Fusion, DataPrep and Dataflow

You will learn to transform data using Data Fusion

You will acquire good data engineering skills in Google Cloud

Working with Big Query Data warehouse in Google Cloud

Why take this course?

In this course, we will be creating a data lake using Google Cloud Storage and bring data warehouse capabilites to the data lake to form the lakehouse architecture using Google BigQuery. We will be building four no code data pipelines using services such as DataStream, Dataflow, DataPrep, Pub/Sub, Data Fusion, Cloud Storage, BigQuery etc.

The course will follow a logical progression of a real world project implementation with hands on experience of setting up  a data lake,  creating data pipelines  for ingestion and transforming your data in preparation for analytics and reporting.


Chapter 1

  • We will setup a project in Google Cloud

  • Introduction to Google Cloud Storage

  • Introduction to Google BigQuery


Chapter 2 - Data Pipeline 1

  • We will create a cloud SQL database and populate with data before we start performing complex ETL jobs.

  • Use DataStream Change Data Capture for streaming data from our Cloud SQL Database into our Data lake built with Cloud Storage

  • Add a pub/sub notification to our bucket

  • Create a Dataflow Pipeline for streaming jobs into BigQuery


Chapter 3 - Data Pipeline 2

  • Introduce Google Data Fusion

  • Author and monitor ETL jobs for tranforming our data and moving them  between different zone of our data lake

  • We will explore the use of Wrangler in Data Fusion for profiling and understanding our data before we starting performing complex ETL jobs.

  • Clean and normalise data

  • Discover and govern data using metadata in Data Fusion


Chapter 4 - Data Pipeline 3

  • Introduction to Google Pub/Sub

  • Building a .Net application for publishing data to a Pub/Sub topic

  • Building a realtime data pipeline for streaming messages to BigQuery


Chapter 5 - Data Pipeline 4

  • Introduction to Cloud DataPrep

  • Profile, Author and monitor ETL jobs for tranforming our data using DataPrep

Screenshots

Practical Data Engineering in GCP: Beginner to Advanced - Screenshot_01Practical Data Engineering in GCP: Beginner to Advanced - Screenshot_02Practical Data Engineering in GCP: Beginner to Advanced - Screenshot_03Practical Data Engineering in GCP: Beginner to Advanced - Screenshot_04

Reviews

Daquan
October 28, 2022
It get's the job done but I feel like the course is very niche. You don't get in depth and advanced understanding of GCP but you also will probably be lost if you don't understand the underlying technologies already. Essentially he walks you through how to setup and use the products. So if you're not a complete beginner and you can already write an query or setup a database but are confused about GCP like I was then this will be good for you.
George
October 24, 2022
não tem os arquivos para importação, vc não consegue acompanhar o andamento do projeto com o professor, é meramente ilustrativo.
Yann
October 13, 2022
Yes for now this course is very good and I can easily follow it. The teacher explanation are so good which gives me a good motivation to going up.
Mousami
February 3, 2022
This course has detailed explanation with real time examples. Instructor has made sure to explain each and every minute detail about the various options inside each resource, which is very nice. I'm looking for a practical example on datastream since long time and this is a golden treat to learn practically. I look forward for more practical lessons on other resources like creating pipelines using Dataproc, Bigtable AI, ML etc.

Charts

Price

Practical Data Engineering in GCP: Beginner to Advanced - Price chart

Rating

Practical Data Engineering in GCP: Beginner to Advanced - Ratings chart

Enrollment distribution

Practical Data Engineering in GCP: Beginner to Advanced - Distribution chart
4512626
udemy ID
1/24/2022
course created date
2/1/2022
course indexed date
Bot
course submited by