Cover Image for Demystifying Lakehouses- From Theory to Practice

Presented by

We organise community events and webinars surrounding Data enginnering topics like CDC, Apache Iceberg, ETL from Database to Data Lakehouses

Hosted By

35 Went

Demystifying Lakehouses- From Theory to Practice

OLake Community Events

Virtual

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Curious about what it really takes to build a lakehouse? Join us for "Demystifying Lakehouses—From Theory to Practice," a technical session led by Akshat Mathur, Product Manager leading Open Data Lakehouses at Cloudera and Apache Hive, Iceberg contributor. Drawing from hands-on experience with enterprise-scale data migrations, Akshat will break down the essential components and decision points for designing and running production-grade lakehouse systems.

We’ll dive into topics like storage layer architecture, choosing the right file and table formats, and implementing robust metadata catalogs with snapshot isolation.

You’ll also learn about integrating multiple compute engines (Spark, Flink, Trino, Impala), handling schema evolution, and setting up governance frameworks with audit logging and data lineage. Plus, discover performance optimization tips using hybrid storage tiers and caching.

The session wraps up with a live demo showing how lakehouses overcome the limitations of traditional data lakes and warehouses.

This is a great opportunity for data engineers and architects looking to modernize their data platforms with practical, real-world strategies.

Presented by

OLake Community Events

We organise community events and webinars surrounding Data enginnering topics like CDC, Apache Iceberg, ETL from Database to Data Lakehouses

Hosted By

35 Went