Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Building Data Intensive Analytic Applications on Top of Delta Lakes

Databricks via YouTube

Overview

This course aims to teach learners about key data reliability challenges and how Delta Lake brings reliability to data lakes at scale. Participants will understand how Delta Lake fits within an Apache Sparkâ„¢ environment and how to use it to realize data reliability improvements. The teaching method includes a combination of instructor-led sessions and hands-on interactive activities. The course is intended for data engineers and practitioners looking to enhance data reliability and performance in their organizations.

Syllabus

Introduction
Data Lakes
Typical Data Lake Project
Who uses Delta
Getting started
Data
Download Data
Park Table
Stop Streaming
Initializing Streaming
Working with Parker
Using Delta Lake
Streaming Job
Multiple Streaming Queries
Counting Continuously
Schema Evolution
Merged Schema
Summary
History
Vacuum
Mods
Merge
Update Data
Define DataFrame
Merge Syntax
Random Data
For Each Batch
Summarize
Community
Question
Thank you

Taught by

Databricks

Reviews

Start your review of Building Data Intensive Analytic Applications on Top of Delta Lakes

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.