Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

University of California, Berkeley

Introduction to Apache Spark

University of California, Berkeley via edX

This course may be unavailable.


Spark is rapidly becoming the compute engine of choice for big data. Spark programs are more concise and often run 10-100 times faster than Hadoop MapReduce jobs. As companies realize this, Spark developers are becoming increasingly valued.

This statistics and data analysis course will teach you the basics of working with Spark and will provide you with the necessary foundation for diving deeper into Spark. You’ll learn about Spark’s architecture and programming model, including commonly used APIs. After completing this course, you’ll be able to write and debug basic Spark applications. This course will also explain how to use Spark’s web user interface (UI), how to recognize common coding errors, and how to proactively prevent errors. The focus of this course will be Spark Core and Spark SQL.

This course covers advanced undergraduate-level material. It requires a programming background and experience with Python (or the ability to learn it quickly). All exercises will use PySpark (the Python API for Spark), but previous experience with Spark or distributed computing is NOT required. Students should take this Python mini-quiz before the course and take this Python mini-course if they need to learn Python or refresh their Python knowledge.

Taught by

Anthony D. Joseph and Jon Bates


3.6 rating, based on 9 Class Central reviews

Start your review of Introduction to Apache Spark

  • Caio Taniguchi is taking this course right now, spending 3 hours a week on it and found the course difficulty to be very easy.

    More of a paid tutorial than an actual course. It's ok to say that the Spark approach is better than the alternatives, but it's just too much. Other than that, the actual contents of the lectures are ok (although shallow), but quizzes are terrible…
  • Sergiy Matusevych completed this course.

  • Stephane Mysona

    Stephane Mysona completed this course.

  • Profile image for Tejas Dharamsi
    Tejas Dharamsi

    Tejas Dharamsi completed this course.

  • Piotr Dziuba

    Piotr Dziuba completed this course.

  • Alvaro Martin Orive

    Alvaro Martin Orive completed this course.

  • Stephane Mysona completed this course.

  • Atila Romero completed this course.

  • Adam Hjerpe completed this course.

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.