Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera Project Network

Machine Learning with PySpark: Data Analysis using SQL

Coursera Project Network via Coursera

Overview

This Guided Project is for beginning Python Developers. In this 1-hour long project-based course, you will learn how to Describe PySpark and Machine Learning, Use PySpark to Capture data, Use PySpark SQL to observe the data, Use PySpark MLlib to prepare training data, and Use PySpark MLlib to predict an outcome. To achieve this, we will work through using PySpark to read data into a PySpark Dataframe, View the Data using PysPark SQL, Prepare the Test and Training data using a heart disease data set, and attempt to predict heart disease using independent variables.

Syllabus

  • Project Overview
    • This Guided Project is for beginning Python Developers. In this 1-hour long project-based course, you will learn how to Describe PySpark and Machine Learning, Use PySpark to capture data, Use PySpark SQL to observe the data, Use PySpark MLlib to prepare training data, and Use PySpark MLlib to predict an outcome. To achieve this, we will work through using PySpark to read data into a PySpark Dataframe, View the Data using PysPark SQL, Prepare the Test and Training data using a heart disease data set, and attempt to predict heart disease using independent variables.

Taught by

David Dalsveen

Reviews

Start your review of Machine Learning with PySpark: Data Analysis using SQL

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.