Data Lake in AWS - Easiest Way to Learn [2024]

Data Lake Mastery: Hands-On Glue, Athena, S3, ETL, Spark, Parquet, QuickSight, Kinesis, Lambda, LLM

4.49 (1684 reviews)
Udemy
platform
English
language
Other
category
instructor
Data Lake in AWS - Easiest Way to Learn [2024]
11,896
students
5.5 hours
content
Mar 2024
last update
$69.99
regular price

What you will learn

Learn about Data Lake vs. Data Warehouse

Key components of a Data Lake Architecture

Query files directly using SQL

Hands-on integration using Kinesis Firehose, Lambda, Comprehend AI, Glue, Athena and S3

Why take this course?

Hello, my name is Chandra Lingam, and I will be your instructor for the Data Lake in AWS course.

In this course, we will begin by gaining an understanding of the fundamental concepts of a data lake and when it is the appropriate solution as opposed to a data warehouse

We will then delve into the various components that make up a data lake solution, including the ability to query files directly using SQL for rapid ad hoc analysis of datasets

During the course, we will cover the topic of handling changes to the structure of the files in the data lake. We will delve into the various scenarios, such as new fields, new partitions, changes in data types, and missing data, and discuss the techniques on how to handle them effectively. We will also delve into Glue Catalog Management and the evolution of schemas, with a focus on minimizing disruption to downstream systems

We will also look into different data formats, such as CSV, Parquet, Avro, and ORC, and examine their respective strengths and weaknesses. Following that, we will delve into Glue ETL, a robust Apache Spark-based solution for data transformation.

This course is filled with hands-on exercises and projects.

You will analyze a University Rankings dataset, which is easy to understand, useful, and has a mix of data types with many data quality issues.

You will learn to utilize Athena for querying data, tackle data quality problems through SQL, and cleanse the data using Glue - Apache Spark ETL.

Additionally, the course covers techniques for simplifying queries using views and visualizing data using Amazon QuickSight.

To showcase the scalability of Athena, we will query the large Amazon Customer Reviews dataset containing over 130 million reviews. Finally, we will construct a serverless application using Kinesis Firehose, Lambda, Comprehend AI, Glue, Athena, and S3, which can process an unlimited number of customer reviews, perform sentiment analysis, and store the results in the data lake for querying.

I am excited to meet you soon!

Thank you!

Chandra Lingam

Compute With Cloud Inc

Screenshots

Data Lake in AWS - Easiest Way to Learn [2024] - Screenshot_01Data Lake in AWS - Easiest Way to Learn [2024] - Screenshot_02Data Lake in AWS - Easiest Way to Learn [2024] - Screenshot_03Data Lake in AWS - Easiest Way to Learn [2024] - Screenshot_04

Reviews

Anil
November 17, 2023
Learnt Glue,Athen , Sagemaker concepts and their importance. Even though I knew what data lake I is understood it very well here. To the point and no extra talk is what I liked.In 5 hours we can learn so much. Thank you very much sir. Explains like a story.
Holger
November 7, 2023
The training is very general and provides a very good overview. A few more details on Glue, Athena and QuickSight would have been better.
Damian
October 6, 2023
The course is definitely a starter for learning how to use AWS services to implement a data lake. The concepts were clearly defined, and I recommend once at 1x and once at 1.25x. If you want an introduction, I highly recommend this course. Its sufficiently clear and the examples work.
Amita
October 4, 2023
I like the instructors tone and pitch and the way he is presenting the complex topics in a superfluous and easy manner.
Ricardo
September 14, 2023
The content was useful and very well explained. However I expected more content about data lake architecture (components, patterns, pros and cons, etc). The labs focus more in AWS Glue and working with s3. Possible the title of the course is misleading.
Ram
August 22, 2023
very good. Awesome lab based tutorial so far. One observation , on lab 56 seems demo for request rate-limit error in s3 error. Please suggest on this, in case I am missing something here please respond. Many thanks .
Khalid
August 7, 2023
I am thoroughly impressed with the content and delivery. This course truly lives up to its name and provides an exceptional learning experience for anyone looking to dive into the world of data lakes on AWS. From the very beginning, the instructor's approach is engaging and approachable. The concepts are explained with remarkable clarity, making complex topics feel much more digestible. The pace of the course is just right. One aspect that sets this course apart is its focus on practicality. The instructor doesn't just talk about concepts in isolation; they provide real-world scenarios and hands-on demonstrations that bring the theory to life. This interactive approach really solidifies the understanding of key concepts and how they are applied in a real AWS environment. The course structure is well thought out, with each section building logically upon the previous one. What's particularly noteworthy is the instructor's responsiveness to questions. Any queries I had were promptly addressed, creating a supportive and dynamic learning environment. This level of engagement truly enhances the overall learning journey. In conclusion, "Data Lake in AWS - Easiest Way to Learn" is an outstanding resource for anyone seeking a comprehensive understanding of data lakes on the AWS platform. Whether you're a beginner or have some experience, this course is designed to meet you where you are and propel you forward. I highly recommend it to anyone looking to gain practical skills and expertise in this domain. Kudos to the instructor for crafting such an enriching course. I'm excited to apply what I've learned to real-world projects and explore the full potential of data lakes in AWS.
César
August 2, 2023
El curso en general me resulto de altísimo nivel, los laboratorios fueron muy entretenidos y didácticos. Por último, no quiero dejar de mencionar la claridad con la que Chandra logra transmitir conceptos.
Gustavo
June 26, 2023
Yes it was what I expected to learn about Data Lake for our transformation projects from Legacy mainframe data to AWS Cloud.
Rahul
June 13, 2023
The explanation is really good, and the structure of course is perfect for any AWS beginner to get started with.
David
March 9, 2023
Really nice high level overview of data lake concepts. Only flaws so far is the deep discussion of the AWS "billing" section which seems out of place for this topic, and there are some dependencies on components like "Glue" that not all users will need. The presenter does a great job at being thorough and explaining things in simple terms.
Oleksandra
February 9, 2023
I love this practical course. Screenshots are outdated but the main idea is clear. Good fundamentals to start to work with databases in AWS!
Alan
February 7, 2023
Precise and to the point material, highly recommended. If you're looking for a quick solution to a particular AWS S3-based ETL architecture problem as I was, this is an excellent resource.
Isaias
January 24, 2023
This introductory course really helps with Data Science and AWS solutions integration. I'm amazed by this course! Congrats!
James
October 30, 2022
The presenter seems to have a strong knowledge of AWS however most ot the lessons are very superficial. I get its supposed to be a course for beginners however I think more details should be given on each top. There very very little actual exercises this would improve the course dramatically

Charts

Price

Data Lake in AWS - Easiest Way to Learn [2024] - Price chart

Rating

Data Lake in AWS - Easiest Way to Learn [2024] - Ratings chart

Enrollment distribution

Data Lake in AWS - Easiest Way to Learn [2024] - Distribution chart

Coupons

DateDiscountStatus
8/29/202080% OFF
expired
3054230
udemy ID
4/26/2020
course created date
5/22/2020
course indexed date
Bot
course submited by