Unlock the potential of data analytics with this comprehensive course on implementing a lakehouse using Microsoft Fabric. Participants will explore how Microsoft Fabric integrates data lake flexibility with data warehouse analytics, providing a robust solution for end-to-end enterprise analytics. Through hands-on exercises and detailed lessons, you'll learn to leverage Microsoft Fabric for creating, managing, and optimizing data lakehouses, enhancing your analytics capabilities.
This course delves into key technologies like Apache Spark and Delta Lake, guiding you through their implementation and usage within Microsoft Fabric. By the end of the course, you'll be proficient in data ingestion, transformation, and orchestration using advanced tools and techniques, positioning yourself to drive impactful data-driven decisions within your organization.
Audience Profile
This course is ideal for data professionals, data engineers, and analytics practitioners looking to enhance their skills in modern data architectures. It is also beneficial for IT managers and business analysts seeking to implement robust data solutions using Microsoft Fabric.
Prerequisites
You should be familiar with basic data concepts and terminology.
Course Outline
Module 1: Introduction to end-to-end analytics using Microsoft Fabric
- Explore end-to-end analytics with Microsoft Fabric
- Data teams and Microsoft Fabric
- Enable and use Microsoft Fabric
Module 2: Get started with lakehouses in Microsoft Fabric
- Explore the Microsoft Fabric Lakehouse
- Work with Microsoft Fabric Lakehouses
- Explore and transform data in a lakehouse
- Exercise - Create and ingest data with a Microsoft Fabric Lakehouse
Module 3: Use Apache Spark in Microsoft Fabric
- Prepare to use Apache Spark
- Run Spark code
- Work with data in a Spark dataframe
- Work with data using Spark SQL
- Visualize data in a Spark notebook
- Exercise - Analyze data with Apache Spark
Module 4: Work with Delta Lake tables in Microsoft Fabric
- Understand Delta Lake
- Create delta tables
- Work with delta tables in Spark
- Use delta tables with streaming data
- Exercise - Use delta tables in Apache Spark
Module 5: Ingest Data with Dataflows Gen2 in Microsoft Fabric
- Understand Dataflows Gen2 in Microsoft Fabric
- Explore Dataflows Gen2 in Microsoft Fabric
- Integrate Dataflows Gen2 and Pipelines in Microsoft Fabric
- Exercise - Create and use a Dataflow Gen2 in Microsoft Fabric
Module 6: Use Data Factory pipelines in Microsoft Fabric
- Understand pipelines
- Use the Copy Data activity
- Use pipeline templates
- Run and monitor pipelines
- Exercise - Ingest data with a pipeline
Module 7: Organize a Fabric lakehouse using medallion architecture design
- Describe medallion architecture
- Implement a medallion architecture in Fabric
- Query and report on data in your Fabric lakehouse
- Considerations for managing your lakehouse
- Exercise - Organize your Fabric lakehouse using a medallion architecture