Overview
Learn how to implement Change Data Capture (CDC) in data lakehouses using Apache Hudi in this 46-minute technical presentation. Explore the fundamental concepts of CDC as a technique for tracking and capturing data modifications while maintaining data freshness and consistency across systems. Discover how combining CDC with data lakehouses addresses common ETL pipeline challenges when moving data from transactional to analytical databases. Master the integration between data lakes and CDC, understand their combined benefits, explore various implementation approaches, and gain insights into key technologies and tools. Follow along as the presenter Sagar shares best practices and criteria for selecting appropriate tools to meet specific data management needs.
Syllabus
Ep 6: Change Data Capture (CDC) in Lakehouse with Apache Hudi
Taught by
Apache Hudi