Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Linux Foundation

An Intro to Presto, the Open Source Distributed SQL Query Engine for the Data Lake

Linux Foundation via YouTube

Overview

This course provides an introduction to Presto, the open-source distributed SQL query engine for the data lake. By the end of the course, learners will understand the fundamentals of Presto technology, its growing popularity, and how to get started with Presto in the cloud. The course covers topics such as Presto overview, use cases, scalable architecture, pluggable connectors, data models, benchmarking, and the Ahana Cloud for Presto managed service. The intended audience for this course includes data platform engineers, data analysts, and anyone interested in data lake analytics.

Syllabus

Intro
What is Presto?
Presto Overview
Presto: One of the Fastest growing presto Open Source Projects in Data Analytics
Common Questions • Is Presto a database?
Presto Use Cases
What makes Presto different?
Scalable Architecture
Pluggable Presto Connectors
Presto Connector Data Model
Presto Hive Connector for Object stores & Files systems
Presto Hive Connector - Data File Types • Supported File Types
Why Presto is Fast?
Presto Benchmarking
Benchto tool from Prestodb
Steps to build the benchto tool
Benchto configuration files
Setting Run environment
Benchmark Run
Performance Metrics
Comparing Results
Ahana Cloud for Presto Managed Service • Enables data platform engineers in minutes vs. days
The Next Data Warehouse: Open Data Lake Analytics

Taught by

Linux Foundation

Reviews

Start your review of An Intro to Presto, the Open Source Distributed SQL Query Engine for the Data Lake

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.