Introduction to Apache Spark for Developers and Engineers

Basic to intermediate level introduction to Apache Spark that provides the main skills required to use the technology

4.05 (123 reviews)
Udemy
platform
English
language
Development Tools
category
instructor
Introduction to Apache Spark for Developers and Engineers
618
students
3 hours
content
Sep 2015
last update
$19.99
regular price

What you will learn

Identify and understand the concepts of Big Data

Clearly describe Apache Spark

Understand and explain the various components of the Spark framework

Differentiate between Spark and Hadoop MapReduce

Download, install and use Spark on a local machine

Identify and understand the main Scala programming language concepts

Develop basic Spark applications

Explain and use Spark Resilient Distributed Datasets

Why take this course?

What is Apache Spark?

Apache Spark is the next generation open source Big Data processing engine. Spark is designed to provide fast processing of large datasets and high performance for a wide range of applications. Spark enables in-memory cluster computing which greatly improves the speed of iterative algorithms and interactive data mining tasks.

Course Outcomes

'Introduction to Apache Spark' includes illuminating video lectures, practical hands-on Scala and Spark exercises, a guide to local installation of Spark, and quizzes. In this course, we guide students through:

  • An explanation of the Spark framework
  • The basics of programming in Scala, Spark's native language
  • An outline of how to work with Spark's primary abstraction, resilient distributed datasets (RDDs).

Upon completion of the course, students will be able to explain core concepts relating to Spark, understand the fundamentals of coding in Scala, and execute basic programming and data manipulation in Spark. This course will take approximately 8 hours to complete.

Recommended Experience

Programming Languages recommended for this course:

  • Scala (course exercises are in Scala)
  • Java
  • Python

Recommended for:

  • Data scientists and engineers
  • Developers
  • Individuals with a basic understanding of: Apache Hadoop, Big Data, programming languages (Scala, Java, or Python)

For students unfamiliar with Big Data and Hadoop, the course will provide a brief overview of each topic.

Why Adastra Academy?

Adastra Academy is a leading source of training and development for Information Management professionals and individuals interested in Data Management and Analytics technology. Our dedication to identifying and mastering emerging technologies guarantees our students are the first to have access to these quality courses. For an exceptional learning experience, our programs include hands-on labs and real world examples allowing students to easily apply their new knowledge.

Screenshots

Introduction to Apache Spark for Developers and Engineers - Screenshot_01Introduction to Apache Spark for Developers and Engineers - Screenshot_02Introduction to Apache Spark for Developers and Engineers - Screenshot_03Introduction to Apache Spark for Developers and Engineers - Screenshot_04

Reviews

Rebecca
March 20, 2019
basically the lecturer read straight from the notes and wish there was more programming in scala than explanations.
Rustam
October 21, 2018
The content is full of buzz words. This is not what a software engineer needs to hear. I was expecting more technical explanations
Gennadiy
February 10, 2017
This is very valuable course. The difficulties are in hands-on exercises, troubleshooting is tricky if anything goes wrong with the proposed exercises.
Nirav
January 10, 2017
Good basics in Scala & Spark, was hoping to get few more advanced concepts in Scala & Spark. Lectures were well organized & good pace.
Bill
November 20, 2016
The course started out strong but failed miserably in the RDD section by requiring the student to understand concepts for which the proper groundwork was not layed. For example, I found little to no value in the hands on exercises which required me to type in commands for which I had been given no explanation or fundamental understanding of. The only way I would be able to understand the examples I was copying would be to go outside of the course to the Spark and Scala documentation online. This became frustrating to the point that I resigned myself to no relying on the course to teach me more than general concepts to which I would then have to self-educate on outside of the course. This caused me to no longer participate in the hands on or quiz activities in the RDD sections. This course would benefit from restricting hand-on activities and quizzes to Spark and Scala operations which are explained in depth, i.e. this function takes these arguments, performs these operations, and results in this outcome or return value. Then the student would better understand what they are doing and why. Additionally, performing transformations on data, then more transformations, then more... etc. without actually seeing the result of the transformation at each step left me unable to create a mental model of what I was doing. I constantly found myself wanting to dump an RDD to a text file so I could see what I just did to the data. This level of step-by-step, do-something, see-the-result, approach should be built into the course so students get more value and understanding of what they are doing. Seeing a summarized value after I perform 10 steps is next to meaningless to me, I already know I can type the things I see on the page and confirmation that I could do this and get "something" out the other end was about all the value that the hands-on examples provided (specifically in the RDD section).
tee
October 20, 2016
This class provided a solid foundation for learning Spark and Scala. The topics covered were comprehensive for a beginner course. It would have been helpful if the course slides were made available. Also, for some of the more complex practice problems, there were no code solutions provided. This resulted in a less meaningful learning experience when trying to apply the concepts from the lecture.
Pat
July 31, 2016
The course needs better examples and demonstrations of Scala and Spark Scripts. Several independent projects after each course topic would be extremely valuable to developers vice a simple overview which can be found in books and other resources.
Nitin
June 17, 2016
This Module will provide excellent understanding of Hadoop technology and eco-system around it to anyone new to Hadoop environment. Great Work!!
John
March 31, 2016
The course has great and clear definitions as well as featuring a great introduction to big data technologies as a whole.
Karel
February 22, 2016
This is a great course! All the information provided was very useful, accessible and actionable. I am so glad I am stuck it out with this course even though I already had some Spark experience. I strongly recommend this course to everyone who want to all Big Data enthusiasts.
Jesse
November 27, 2015
Scala was barely touched, and the Spark content ended abruptly without connecting the REPL examples with any kind of real setup. The exercises were mostly disappointing. Also, the course has only Windows setup instructions. I ended up using https://github.com/arjones/vagrant-spark-zeppelin with virtualbox.
Zac
November 5, 2015
A great introduction to Apache Spark, especially for those looking to use Scala for programming.The final section on RDDs is quite comprehensive for an introduction.
Irina
August 26, 2015
This course is very valuable for me. It brings my knowledge to the needed level, that I can use Spark in my work, very comprehensive and informative. The materials are presented in understandable and interesting way. This is the best introduction to Spark for me.
Corben
July 2, 2015
i am beginer in spark, for my view, i found the course interessting, that is a good startup, i am french, the teacher was not to fast, and i listen him perfectly, i just regret not more complex sample with real use case in addendum with the usage machine learning, at end course, for analyse and test. i dont see any interaction with hive or pig in hadoop context. i appreciate the install note, very usefull. if you have advanced spark course, cluster standalone deployment, machine learning, advanced feature, hadoop context, i would buy

Charts

Price

Introduction to Apache Spark for Developers and Engineers - Price chart

Rating

Introduction to Apache Spark for Developers and Engineers - Ratings chart

Enrollment distribution

Introduction to Apache Spark for Developers and Engineers - Distribution chart

Related Topics

526136
udemy ID
6/11/2015
course created date
11/21/2019
course indexed date
Bot
course submited by