PySpark - Python Spark Hadoop coding framework & testing

Big data Python Spark PySpark coding framework logging error handling unit testing PyCharm PostgreSQL Hive data pipeline

4.35 (85 reviews)
Udemy
platform
English
language
IT Certification
category
instructor
PySpark - Python Spark Hadoop coding framework & testing
4,488
students
3.5 hours
content
Jan 2024
last update
$49.99
regular price

What you will learn

Python Spark PySpark industry standard coding practices - Logging, Error Handling, reading configuration, unit testing

Building a data pipeline using Hive, Spark and PostgreSQL

Python Spark Hadoop development using PyCharm

Why take this course?

This course will bridge the gap between your academic and real world knowledge and prepare you for an entry level Big Data Python Spark developer role. You will learn the following

  • Python Spark coding best practices

  • Logging

  • Error Handling

  • Reading configuration from properties file

  • Doing development work using PyCharm

  • Using your local environment as a Hadoop Hive environment

  • Reading and writing to a Postgres database using Spark

  • Python unit testing framework

  • Building a data pipeline using Hadoop , Spark and Postgres

Prerequisites :

  • Basic programming skills

  • Basic database knowledge

  • Hadoop entry level knowledge

Screenshots

PySpark - Python Spark Hadoop coding framework & testing - Screenshot_01PySpark - Python Spark Hadoop coding framework & testing - Screenshot_02PySpark - Python Spark Hadoop coding framework & testing - Screenshot_03PySpark - Python Spark Hadoop coding framework & testing - Screenshot_04

Reviews

Thomas
May 15, 2022
The instructor jumps between files too fast, and is hard to track sometimes. I find myself stopping each lesson ever 10-15 seconds just to keep my PyCharm code looking like what I see on the screen. For those not used to PyCharm, some of the shortcuts being used, nice if those were shared, or at least explain what is being done.
Manoj
February 2, 2022
Excellent course but expecting few more topics like Unit testing frameworks and others could have been explained in depth
Francesco
October 27, 2021
I think the content is very useful as it explains real word scenarios. However, I believe it could be improved by following some best practices (e.g. no need to import packages if they are not used)
Khajaasmath
February 22, 2021
This is best course I have ever taken. Best coding standards and framework that I can quickly use for pyspark projects
Divyaansh
February 5, 2021
Course is good covers most of the topics. Would have been even better if there were a section covering cross environment dependency and how the configuration would like along with one small section covering deployment steps.
Umesh
January 27, 2021
I was looking for an end-to-end spark developer process course and I am glad that I finally found one. Thank you...

Charts

Price

PySpark - Python Spark Hadoop coding framework & testing - Price chart

Rating

PySpark - Python Spark Hadoop coding framework & testing - Ratings chart

Enrollment distribution

PySpark - Python Spark Hadoop coding framework & testing - Distribution chart
3616430
udemy ID
11/5/2020
course created date
11/22/2020
course indexed date
Bot
course submited by