

Real-Time Data Lakes featuring ClickHouse® and StarRocks
Real-Time Data Lakes featuring ClickHouse® and StarRocks
Real-time databases are integrating with data lakes to reduce storage costs and share data with AI and data science. Please join us to hear from a range of experts—including longtime ClickHouse and StarRocks practitioners—as they share current problems and solutions while navigating the transition from closed storage models to open table formats like Apache Iceberg. There will be plenty of time to network: discuss problems and brainstorm solutions with your peers.
Presenters
Robert Hodges, CEO @ Altinity
James Greenhill, Chief Data Wrangler @ PostHog
Sida Shen, Product Manager @ CelerData
Description of the talks:
Adapting ClickHouse to use Apache Iceberg Storage - Robert Hodges, CEO @ Altinity.
Covers Altinity's Project Antalya, which is adapting open source ClickHouse to introduce separation of compute and storage using Iceberg tables as. Architecture, performance results, and roadmap are included.
The PostHog Data Lakehouse - How we turned ClickHouse into our Lake House - James Greenhill, Chief Data Wrangler @ PostHog
How we built a Data Warehouse product on top of ClickHouse without breaking the bank using S3, Parquet, ArrowStream, and more!
Achieving Data Warehouse Performance on Apache Iceberg - Sida Shen, Product Manager, CelerData
This talk dives into technical optimizations that deliver low-latency, high-concurrency queries on Apache Iceberg without sacrificing openness. Together, we'll examine what kills performance when querying Iceberg, highlight best practices that make queries faster, and evaluate query engine optimizations for Iceberg—including handling position and equality delete tables, distributed metadata parsing, and more. You'll hear real-world stories from leading enterprises who have used these lessons to optimize Apache Iceberg performance at scale and walk away with actionable techniques for making your Iceberg lakehouse faster than ever.