


Demystifying Lakehouses- From Theory to Practice
Curious about what it really takes to build a lakehouse? Join us for "Demystifying Lakehouses—From Theory to Practice," a technical session led by Akshat Mathur, Product Manager leading Open Data Lakehouses at Cloudera and Apache Hive, Iceberg contributor. Drawing from hands-on experience with enterprise-scale data migrations, Akshat will break down the essential components and decision points for designing and running production-grade lakehouse systems.
We’ll dive into topics like storage layer architecture, choosing the right file and table formats, and implementing robust metadata catalogs with snapshot isolation.
You’ll also learn about integrating multiple compute engines (Spark, Flink, Trino, Impala), handling schema evolution, and setting up governance frameworks with audit logging and data lineage. Plus, discover performance optimization tips using hybrid storage tiers and caching.
The session wraps up with a live demo showing how lakehouses overcome the limitations of traditional data lakes and warehouses.
This is a great opportunity for data engineers and architects looking to modernize their data platforms with practical, real-world strategies.
