Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Online Course

CS115x: Advanced Apache Spark for Data Science and Data Engineering

University of California, Berkeley via edX

Overview

Gain a deeper understanding of Spark by learning about its APIs, architecture, and common use cases.  This statistics and data analysis course will cover material relevant to both data engineers and data scientists.  You’ll learn how Spark efficiently transfers data across the network via its shuffle, details of memory management, optimizations to reduce compute costs, and more.  Learners will see several use cases for Spark and will work to solve a variety of real-world problems using public datasets.  After taking this course, you should have a thorough understanding of how Spark works and how you can best utilize its APIs to write efficient, scalable code.  You’ll also learn about a wide variety of Spark’s APIs, including the APIs in Spark Streaming. 

Taught by

Anthony D. Joseph and Jon Bates

Tags

Reviews

5.0 rating, based on 1 reviews

Start your review of CS115x: Advanced Apache Spark for Data Science and Data Engineering

Related Courses

Class Central

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free

Never stop learning Never Stop Learning!

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free