Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

LinkedIn Learning

Apache Spark Essential Training

via LinkedIn Learning

Overview

Prepare for a new career with $100 off Coursera Plus
Gear up for jobs in high-demand fields: data analytics, digital marketing, and more.
Get up to speed with Spark, and discover how to leverage this powerful platform to efficiently and effectively work with big data.

Syllabus

Introduction
  • Welcome
  • What you should know before watching this course
  • Using the exercise files
1. Introducing Apache Spark
  • Understanding Spark
  • Origins of Spark
  • Overview of Spark components
  • Where Spark shines
  • Overview of Databricks
  • Introduction to notebooks and PySpark
2. Analyzing Data in Spark
  • Understanding data interfaces
  • Working with text files
  • Loading CSV data into DataFrames
  • Exploring data in DataFrames
  • Saving your results
3. Using Spark SQL to Analyze Data
  • Creating tables
  • Querying data with Spark SQL
  • Visualizing data in Databricks notebooks
4. Running Machine Learning Algorithms Using MLlib
  • Introduction to machine learning with Spark
  • Preparing data for machine learning
  • Building a linear regression model
  • Evaluating a linear regression model
  • Visualizing a linear regression model
5. Real-Time Data Analysis with Spark Streaming
  • Introduction to streaming analytics
  • Streaming context setup
  • Querying streaming data
6. Connecting BI Tools to Spark
  • Setting up spark locally
  • Connecting Jupyter notebooks to Spark
  • Other connection options
Conclusion
  • Next steps

Taught by

Ben Sullins

Reviews

4.6 rating at LinkedIn Learning based on 469 ratings

Start your review of Apache Spark Essential Training

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.