Data Engineering Fundamentals
IBM via edX Professional Certificate
Overview
Data engineering is a growth tech segment, with considerable demand for skilled data engineers. Data engineering makes quality data available for business operations, business intelligence and data-driven decision making.
This six-course Professional Certificate from IBM is an excellent base for those interested in a career in data engineering. Through these data engineering courses, you will learn the core principles and get to practice your new skills with hands-on labs. You will learn about the data engineering ecosystem, data integration pipelines, data repositories, Business Intelligence and Reporting tools. You will understand Data repositories, such as relational and non-relational databases, data warehouses, data marts, data lakes, and big data stores, as well as how to store and process this data.
The certificate starts with an introductory course, then progresses through how Python is used by Data Scientists, in Artificial Intelligence and Development, and gives you the opportunity to create a Python project to put these skills into practice. The certificate then covers relational databases and SQL.
ÂThis Professional Certificate does not require any prior programming or data science skills. These online learning tools will provide you with practical skills and experience in collating data from data sources for factual analysis and providing organizations with the basis for data-driven decision making.
Syllabus
Course 1: Data Engineering Basics for Everyone
Learn about data engineering concepts, ecosystem, and lifecycle. Also learn about the systems, processes, and tools you need as a Data Engineer in order to gather, transform, load, process, query, and manage data so that it can be leveraged by data consumers for operations, and decision-making.
Course 2: Python Basics for Data Science
This Python course provides a beginner-friendly introduction to Python for Data Science. Practice through lab exercises, and you'll be ready to create your first Python scripts on your own!
Course 3: Python for Data Engineering Project
An opportunity to apply your foundational Python skills via a project, using various techniques to collect and work with data
Course 4: Relational Database Basics
This course teaches you the fundamental concepts of relational databases and Relational Database Management Systems (RDBMS).
Course 5: SQL for Data Science
Learn how to use and apply the powerful language of SQL to better communicate and extract data from databases - a must for anyone working in the data science field.
Course 6: SQL Concepts for Data Engineers
In this short course you will learn additional SQL concepts such as views, stored procedures, transactions and joins.
Courses
-
5 weeks long, 2-5 hours a week
View detailsPlease Note: Learners who successfully complete this IBM course can earn a skill badge —a detailed, verifiable and digital credential that profiles the knowledge and skills you’ve acquired in this course. Enroll to learn more, complete the course and claim your badge!
Kickstart your learning of Python for data science, as well as programming in general with this introduction to Python course. This beginner-friendly Python course will quickly take you from zero to programming in Python in a matter of hours and give you a taste of how to start working with data in Python. ~~~~
Upon its completion, you'll be able to write your own Python scripts and perform basic hands-on data analysis using our Jupyter-based lab environment. If you want to learn Python from scratch, this course is for you.
You can start creating your own data science projects and collaborating with other data scientists using IBM Watson Studio. When you sign up, you will receive free access to Watson Studio. Start now and take advantage of this platform and learn the basics of programming, machine learning, and data visualization with this introductory course.
-
4 weeks long, 2-4 hours a week
View detailsMuch of the world's data lives in databases. SQL (or Structured Query Language) is a powerful programming language that is used for communicating with and extracting various data types from databases. A working knowledge of databases and SQL is necessary to advance as a data scientist or a machine learning specialist. The purpose of this course is to introduce relational database concepts and help you learn and apply foundational knowledge of the SQL language. It is also intended to get you started with performing SQL access in a data science environment.
The emphasis in this course is on hands-on, practical learning. As such, you will work with real databases, real data science tools, and real-world datasets. You will create a database instance in the cloud. Through a series of hands-on labs, you will practice building and running SQL queries. You will also learn how to access databases from Jupyter notebooks using SQL and Python.
No prior knowledge of databases, SQL, Python, or programming is required.
Taught by
Joseph Santarcangelo, Ramesh Sannareddy and Rav Ahuja