Pentaho for ETL & Data Integration Masterclass 2024 - PDI 9

Use Pentaho Data Integration tool for ETL & Data warehousing. Do ETL development using PDI 9.0 without coding background

4.52 (1734 reviews)
Udemy
platform
English
language
Data & Analytics
category
60,951
students
9.5 hours
content
Jan 2024
last update
$84.99
regular price

What you will learn

Understanding of the entire data integration process using PDI

Extracting data from all popular data sources including Excel, JSON, Zipped files, TXT files and even cloud storage

Cleaning the data using Pentaho Data Integration

Applying business rules on the data in PDI

Different types of Data transformations

Loading the data into different formats

Managing SQL database using PDI

Metadata Injection - a powerful tool offered by PDI

Understanding of the concepts of data marts and data warehouse

Description

What is ETL?

The ETL (extract, transform, load) process is the most popular method of collecting data from multiple sources and loading it into a centralized data warehouse. ETL is an essential component of data warehousing and analytics.

Why Pentaho for ETL?

Pentaho has phenomenal ETL, data analysis, metadata management and reporting capabilities. Pentaho is faster than other ETL tools (including Talend). Pentaho has a user-friendly GUI which is easier and takes less time to learn. Pentaho is great for beginners. Also, Pentaho Data Integration (PDI) is an important skill in data analytics field.

How much can I earn?

In the US, median salary of an ETL developer is $74,835 and in India average salary is Rs. 7,06,902 per year. Accenture, Tata Consultancy Services, Cognizant Technology Solutions, Capgemini, IBM, Infosys etc. are major recruiters for people skilled in ETL tools; Pentaho ETL is one of the most sought-after skills that recruiters look for. Demand for Pentaho Data Integration (PDI) techniques is increasing day after day.

What makes us qualified to teach you?

The course is taught by Abhishek and Pukhraj. Instructors of the course have been teaching Data Science and Machine Learning for over a decade. We have experience in teaching and implementing Pentaho ETL, Pentaho Data Integration (PDI) for data mining and data analysis purposes.

We are also the creators of some of the most popular online courses - with over 150,000 enrollments and thousands of 5-star reviews like these ones:

I had an awesome moment taking this course. It broaden my knowledge more on the power use of Excel as an analytical tools. Kudos to the instructor! - Sikiru

Very insightful, learning very nifty tricks and enough detail to make it stick in your mind. - Armand

Our Promise

Teaching our students is our job and we are committed to it. If you have any questions about the course content on Pentaho, ETL, practice sheet or anything related to any topic, you can always post a question in the course or send us a direct message.

Download Practice files, take Quizzes, and complete Assignments

With each lecture, there is a practice sheet attached for you to follow along. You can also take quizzes to check your understanding of concepts on Pentaho, ETL, Pentaho Data Integration, Pentaho ETL. Each section contains a practice assignment for you to practically implement your learning on Pentaho, ETL, Pentaho Data Integration, Pentaho ETL. Solution to Assignment is also shared so that you can review your performance.

By the end of this course, your confidence in using Pentaho ETL and Pentaho Data Integration (PDI) will soar. You'll have a thorough understanding of how to use Pentaho for ETL and Pentaho Data Integration (PDI) techniques for study or as a career opportunity.

Go ahead and click the enroll button, and I'll see you in lesson 1 of this Pentaho ETL course!

Cheers

Start-Tech Academy

Content

Introduction

Welcome to the course
Course Resources

Pentaho Data Integration (PDI) Installation and Setup

Setting up environment and installing PDI
Opening Spoon - The Graphical UI

A Simple ETL Demonstration

The example problem statement
Demonstration of a PDI transformation
Demonstration of a PDI Job

The ETL process: The practical part begins here

Data and the ETL process

DATA EXTRACTION: Extracting tabular data

Manually entering data into PDI
Inputting Data from a TXT (text) file
Input from multiple CSV files at the same time
Inputting Data from an Excel file
Extracting Data from Zipped files

DATA EXTRACTION: Extracting non-tabular data

Extracting from XML
Extracting from JSON

Extracting from an SQL table

Plan for importing sales Data
Creating Sales table in SQL
Extracting from an SQL table

Storing and Retrieving Data from Cloud storage

Storing Data on AWS S3
Reading data from AWS S3

Merging Data Streams

Concepts: Merging Data Streams
Sorted Merge Step

Data Cleansing

Introduction to Data Cleansing
Value Mapper Step
Replace in String Step
Fuzzy Match concepts
Fuzzy Match Step in PDI
Fuzzy Match Algorithms
Formula Step and changing data format
Common Data Cleaning Steps

Data Validation

Introduction to Data validation
Data_validation 1 - String-to-Int and integer range validations
Data validation 2 - Checking Reference Values using stream look-up
Data validation 3 - Order date < shipping date using calculator step
Common Data Validation steps

Error Handling

Correcting the errors and merging with main stream
Writing the errors to the log
Writing the errors to a separate file

Transformation and Analytics steps

Concatenating Address Fields
Data Aggregation using Group-by
Normalization and Denormalization
Number Range Step

Screenshots

Pentaho for ETL & Data Integration Masterclass 2024 - PDI 9 - Screenshot_01Pentaho for ETL & Data Integration Masterclass 2024 - PDI 9 - Screenshot_02Pentaho for ETL & Data Integration Masterclass 2024 - PDI 9 - Screenshot_03Pentaho for ETL & Data Integration Masterclass 2024 - PDI 9 - Screenshot_04

Reviews

Shilpa
October 6, 2023
It was a nice experience going with the course. Especially the demonstration of all the topic and the walkthrough session is really helpful to understand PDI easily.
Jeevan
September 6, 2023
I completed the course and I must say there is nothing like it for Pentaho anywhere else. Thanks to the instructor for making, it so fun and interesting!
Stanislav
August 29, 2023
I truly appreciate the effort that went into preparing this course. It was not only informative and useful, but the deliberate slow pace at which it was presented made it exceptionally accessible and easy to grasp.
Angela
August 11, 2023
Thank you for the clear explanations, and with the subtitles, I know I'll understand this class well!
Mangesh
June 25, 2023
Instructors were good and having good explanation techniques and power. Learned a lot and it will definitely going to help me in number of ways. Thanx a lot. Keep it up.
Charmeet
June 19, 2023
Content was good for someone who is starting with an ETL tool. More content around ETL metadata injection would have better insight into the power of this step.
Donald
June 6, 2023
Everything was fine, but I think the Meta-data injection could provide additional step by step process explanations. The meta-data injection seems like it will be frequently used so, even if some users express the detail as being to much, I think adding full explanations in this part will assist in user's better being able to replicate and build from it. Maybe add a second example of meta-data injection that might be a little more complex to show its versatility and additional functionality.
Luis
May 29, 2023
Magnífico curso, explicaciones claras, ejemplos sencillos para comprender la potencia de Pentaho para poder aplicar el conocimiento adquirido.
David
May 22, 2023
This course covers many of the core elements of Pentaho and gives a good solid foundation of how to develop ETL transformations and jobs using it. The course is backed up with practical examples and course materials to help you follow along and try out examples of your own.
Alessandro
April 21, 2023
Pro: Very simple and clear. Con: Very simple, some smaller mistakes. Strong indian accent I am satisfied with this course
Abdoul
March 16, 2023
Ce cours m'a permis de d'avoir des connaissances sur l'outil ETL ,facile d'exploitation et un puissant outil d'extraction de transformation et de sauvegarde de données.
Mevr.
March 10, 2023
Mijn ervaring tot nu toe is dat zij een aandacht voor de details hebben die out of the box zijn. Ik hoop dat de hele cursus goed wordt en sluit naar mijn verwachtingen. :-)
Sandeep
March 8, 2023
The course was very good from a fresher perspective. The explanation was on point for most of the parts of the course(almost 90%). I had difficulty in understanding the set variables and metadata injection topics, apart from that it was wonderful.
Charles
January 16, 2023
Each and every steps are explained in an easy to understand manner. A right choice for a beginner. Enjoy!
Patricio
December 6, 2022
Gran elección, he aprendido mucho y sobre todo de las funciones que existen para realizar validaciones y limpieza de datos

Coupons

DateDiscountStatus
8/8/2020100% OFF
expired
8/15/2020100% OFF
expired
8/22/2020100% OFF
expired
10/18/2020100% OFF
expired
10/26/2020100% OFF
expired
11/7/2020100% OFF
expired
11/28/2020100% OFF
expired
4/2/2021100% OFF
expired
4/21/2021100% OFF
expired
7/10/2021100% OFF
expired
7/22/2021100% OFF
expired
8/1/2021100% OFF
expired
8/17/2021100% OFF
expired
9/21/2021100% OFF
expired
10/1/2021100% OFF
expired
10/15/2021100% OFF
expired
1/12/2022100% OFF
expired
4/4/2022100% OFF
expired
4/13/2022100% OFF
expired
5/7/2022100% OFF
expired
5/19/2022100% OFF
expired
6/16/2022100% OFF
expired
6/26/2022100% OFF
expired
7/7/2022100% OFF
expired
8/19/2022100% OFF
expired
9/3/2022100% OFF
expired
9/14/2022100% OFF
expired
10/5/2022100% OFF
expired
3/6/2023100% OFF
expired

Charts

Price

Pentaho for ETL & Data Integration Masterclass 2024 - PDI 9 - Price chart

Rating

Pentaho for ETL & Data Integration Masterclass 2024 - PDI 9 - Ratings chart

Enrollment distribution

Pentaho for ETL & Data Integration Masterclass 2024 - PDI 9 - Distribution chart
3333760
udemy ID
7/15/2020
course created date
8/3/2020
course indexed date
Bot
course submited by