Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

edX

Data Engineering Foundations

Pragmatic AI Labs via edX Professional Certificate

Overview

Coursera Plus Monthly Sale: All Certificates & Courses 40% Off!

Embark on a transformative journey designed for aspiring data engineers, software developers, and data professionals. Through hands-on learning experiences, you'll master data manipulation, pipeline building, and task automation using Python, Rust, Bash, SQL, and cutting-edge tools like Pandas, FastAPI, and Kubernetes.

Develop your proficiency in critical areas such as Linux and Bash scripting, data manipulation with Python and SQL, web application development, containerization, and data pipeline optimization. Gain hands-on experience with industry-standard platforms like Hadoop, Spark, and Snowflake, while leveraging powerful tools like Databricks and MLflow to execute data analytics and manage machine learning workflows. Our renowned faculty and industry connections ensure you receive a world-class education, keeping pace with the latest trends and best practices.

By completing this program, you'll acquire a versatile skill set that empowers you to tackle real-world data challenges, automate workflows, and drive data-driven decision-making. Whether you're seeking to advance your current career or transition into the exciting field of data engineering, this program provides you with the knowledge, tools, and hands-on experience necessary to thrive in today's data-centric world.

Syllabus

Courses under this program:
Course 1: Linux and Bash for Data Engineering

Master Linux and Bash essentials for data engineering. Learn to manipulate data, build pipelines, and automate tasks using shell scripting and powerful Linux tools.



Course 2: Scripting with Python and SQL for Data Engineering

Master data manipulation using Python and SQL for data engineering. Extract, transform, and load data efficiently to drive insights and decision-making.



Course 3: Python and Pandas for Data Engineering

Master Python essentials and Pandas for data engineering. Learn to set up development environments, manipulate data, and efficiently solve real-world problems.



Course 4: Web Applications and Command-Line Tools for Data Engineering

Learn to build web apps, microservices, and command-line tools for efficient data engineering using Python, FastAPI, and Rust.



Course 5: Virtualization, Docker, and Kubernetes for Data Engineering

Master virtualization, Docker, and Kubernetes for data engineering. Gain hands-on experience with cloud development environments, container orchestration, and production best practices.



Course 6: Spark, Hadoop, and Snowflake for Data Engineering

Gain the skills for building efficient and scalable data pipelines. Explore essential data engineering platforms (Hadoop, Spark, and Snowflake) and learn how to optimize them using Python, PySpark, and MLflow.



Course 7: Advanced Data Engineering

Become an expert in scaling data systems. Master Celery, Airflow, graph databases. Build real-world solutions for massive datasets and complex workflows. Optimize performance at enterprise scale.

10 hands-on labs and exercise to "learn by doing".



Course 8: Rust Data Engineering

Use Rust's power for high-performance, concurrent data pipelines - from core principles to real-world deployments.



Courses

Taught by

Noah Gift, Alfredo Deza and Kennedy Behrman

Reviews

Start your review of Data Engineering Foundations

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.