Cover Image for Demystifying Lakehouses- From Theory to Practice
Cover Image for Demystifying Lakehouses- From Theory to Practice
Avatar for OLake Community Events
We organise community events and webinars surrounding Data enginnering topics like CDC, Apache Iceberg, ETL from Database to Data Lakehouses
35 Went

Demystifying Lakehouses- From Theory to Practice

Virtual
Registration
Past Event
Welcome! To join the event, please register below.
About Event

Curious about what it really takes to build a lakehouse? Join us for "Demystifying Lakehouses—From Theory to Practice," a technical session led by Akshat Mathur, Product Manager leading Open Data Lakehouses at Cloudera and Apache Hive, Iceberg contributor. Drawing from hands-on experience with enterprise-scale data migrations, Akshat will break down the essential components and decision points for designing and running production-grade lakehouse systems.

We’ll dive into topics like storage layer architecture, choosing the right file and table formats, and implementing robust metadata catalogs with snapshot isolation.

You’ll also learn about integrating multiple compute engines (Spark, Flink, Trino, Impala), handling schema evolution, and setting up governance frameworks with audit logging and data lineage. Plus, discover performance optimization tips using hybrid storage tiers and caching.

The session wraps up with a live demo showing how lakehouses overcome the limitations of traditional data lakes and warehouses.

This is a great opportunity for data engineers and architects looking to modernize their data platforms with practical, real-world strategies.

Avatar for OLake Community Events
We organise community events and webinars surrounding Data enginnering topics like CDC, Apache Iceberg, ETL from Database to Data Lakehouses
35 Went