Data Engineering - Apache Spark

Data Engineering – Apache Spark

Updated: October 14, 2021

About Course
===============
This course is on one of the most popular and widely used Data Engineering frameworks Apache Spark. Many organizations are using this technology to solve their data engineering business problems. This course is about Spark batch aka Spark SQL. You will also learn how distributed parallel processing works. You can use Spark batch for data analytics, ETL, reporting, ingestion and many more use cases.

Apache Spark developers are hugely in demand. Companies are bending recruitment rules to hire developers with such skills.

About Author
===============

The author/trainer of this course is Sandeep Khurana (https://www.linkedin.com/in/skhurana333/) who has 25 years of experience in Java and Big Data related technologies. Sandeep has worked with companies like Yahoo!, IBM, Oracle, Nokia and Intuit.

Sandeep has hosted this course on the Edurigo platform for a cause. All the proceedings collected from the sale of this course will be used by Edurigo to create free content for underprivileged kids.

Course Content
===============

This course consists of 9 modules totalling 7.5 hours of video sessions. It also contains 22 hands-on assignments. Plus a lot of reference material and quizzes.

You also get to join the community of more than 400 developers over WhatsApp and discord channels where you can participate in discussions with the author and other developers.

Who should take this course
============================

Take this course if you want to learn Spark SQL hands-on with examples, whether you are a developer or architect or fresh out of college. Spark code can be written in Scala, Python, Java, and R. The course examples are in Scala which can be ported to any of these languages. You can use Databricks community free edition to practice or you can set up Spark on your local laptop as well.