

Apache Iceberg™ Bay Area Meetup: Oct Edition
🔹 Apache Iceberg™ Bay Area: Sept Edition
📍 Contemporary Jewish Museum, San Francisco
🗓️ October 1, 2025 | ⏰ Half-day, afternoon event
💥 Free to attend – limited spots available
The Sept Edition is here. We’re taking Apache Iceberg™ to the next level — multi-track talks, real-world production stories, deep technical dives, and a packed room of data builders, operators, and open source contributors.
Whether you're scaling your Iceberg deployment, exploring new integrations, or just curious what all the buzz is about, this is the must-attend Iceberg event of the fall.
Expect:
🧠 Multiple tracks of technical talks
💬 Networking with top engineers & practitioners
🍴 Delicious food & drinks
🤝 A fun, high-energy community atmosphere
📢 Call for Proposals is Open!
👉 Submit your talk here: https://sessionize.com/iceberg-meetup-bay-area/
We’re now accepting talk submissions.
🔍 What we’re looking for:
🧊 Tip of the Iceberg – Technical Internals & Architecture: Deep dives into how Iceberg works, performance engineering, file formats, metadata layers, and internals. Perfect for advanced users and contributors.
❄️ Breaking the Ice – Real-World Use Cases & War Stories. Tell us what worked—and what didn’t. We want to hear about your production rollouts, scaling challenges, and lessons learned along the way.
🥃 On the Rocks – Integrations, Tools & Ecosystem. How does Iceberg fit into your stack? Talk about streaming, data lakes, compute engines, catalog systems, or open-source tooling you’ve built or used.
🌐 Future Ice Age – Roadmap, Innovation & Emerging Trends. Got something new on the horizon? This is for forward-looking talks, prototypes, and ideas shaping the next phase of Iceberg and the lakehouse ecosystem.
Anything that helps others learn, build, or deploy Iceberg better.
Talks will be 20-25 minutes long plus 5-10 mins Q&A (total 30 mins), and this is a 4-track event, so we’re welcoming a range of technical depths and perspectives.
🗓️ CFP Deadline: Friday, September 5, 2025 at 11:59 PM PT
We can’t wait to see what you’ve been working on.
Apache Iceberg™, Iceberg, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse or review the materials provided at this event, which is managed by BigCo.
Agenda
12:00 PM - 1:00 PM: Door Open & Lunch & Networking
1:00 PM - 1:45 PM: Welcome Remarks & Keynote
1:45 PM - 2:00 PM: Break
2:00 PM - 3:30 PM: Presentations (2-4 tracks)
3:30 PM - 4:00 PM: Snack Break
4:00 PM - 6:00 PM: Presentations (2-4 tracks)
6:00 PM - 7:30 PM: Dinner & Networking
About Databricks
Databricks is the data and AI company. More than 10,000 organizations worldwide — including Block, Comcast, Condé Nast, Rivian, Shell and over 60% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to take control of their data and put it to work with AI. Databricks is headquartered in San Francisco, with offices around the globe, and was founded by the original creators of Lakehouse and Apache Spark™.
📚 Check out Tabular has joined Databricks towards a joint vision of the open lakehouse.
📲 Follow Databricks on LinkedIn, X, and Facebook.
🖥️ Subscribe to Databricks YouTube.
💾 Sign up for Databricks Express Setup and get $400 free credits when using your work email.
About PuppyGraph
PuppyGraph is the first and only real time, zero-ETL graph query engine in the market, empowering companies to transform existing relational data stores into a unified graph model in under 10 minutes, bypassing traditional graph databases' cost, latency, and maintenance hurdles.
💬 Join PuppyGraph Community Slack
📚 Check out PuppyGraph Engineering Blog
📲 Follow PuppyGraph on LinkedIn & Twitter
🖥️ Subscribe to PuppyGraph YouTube
💾 Download PuppyGraph Forever Free Developer Edition (no form & no payment required)
About AWS
Whether you're looking for generative AI, compute power, database storage, content delivery, or other functionality, AWS has the services to help you build sophisticated applications with increased flexibility, scalability, and reliability. AWS is the world's largest Cloud Services provider. https://aws.amazon.com/
At AWS, Apache Iceberg is an open-source table format that simplifies table management while improving performance. AWS analytics services such as Amazon SageMaker Lakehouse, Amazon S3Tables, Amazon EMR, Amazon Glue, Amazon Athena, and Amazon Redshift include native support for Apache Iceberg, so you can easily build transactional data lakes on top of Amazon Simple Storage Service (Amazon S3) on AWS.
Additional Resources and Information:
📚 Workshop: Running Apache Iceberg on AWS
📚 Blogs: Apache Iceberg on AWS
📚 AWS Prescriptive Guidance: Using Apache Iceberg on AWS
🖥️ Subscribe to AWS Events and AWS Developers
💜 We’re hiring, join our team
About Celerdata
CelerData (powered by StarRocks) is the fastest query engine for customer-facing and AI-driven analytics at petabyte scale. With native Apache Iceberg integration, it delivers low-latency, high-concurrency queries directly on open data—without ingestion delays or costly pipelines.
Trusted by industry leaders like Pinterest, Tencent, and Expedia, CelerData powers the next generation of analytics on the Lakehouse.
💬 Join the StarRocks Slack Channel
🖥️ Subscribe to CelerData's YouTube channel
📚 Follow CelerData on LinkedIn
☁️ Try CelerData Cloud and claim your 30-day free trial
💜 We’re hiring, join our team!
About Dremio
Dremio is the intelligent lakehouse platform that accelerates AI and analytics with AI-ready data products, unified access, and automated performance optimization. Built on Apache Iceberg, Arrow, and Polaris, Dremio combines a business-friendly semantic layer, a high-speed SQL engine, and an enterprise-grade catalog to deliver fast, governed, and discoverable data across cloud and on-prem environment
📚 Learn more about Dremio
🖥️ Get Started with Dremio for Free
About Wherobots
Wherobots is the Spatial Intelligence Cloud that unlocks planetary-scale answers from geospatial data. It enables high performance geospatial ETL, analytics, and AI at planetary-scale with a modern data lakehouse architecture. Developed by the original creators of Apache Sedona, Wherobots empowers data teams to utilize spatial data up to 20x faster at a fraction of the cost of alternative cloud services when used for geospatial analytics and computer vision.
📚 Follow Wherobots on LinkedIn and Twitter
🖥️ Check out Wherobots Blog
📲 Subscribe to Wherobots' YouTube Channel
💜 Try Wherobots for Free
💬 Join the Apache Sedona™ community
About Daft
Daft is a distributed query engine providing simple and reliable data processing for any modality and scale. Process petabytes of multimodal data with declarative queries that just work, turning months of infrastructure struggle into days of breakthrough application development. Daft will enable you to build AI systems that were previously impossible, powered by infrastructure that embraces the inherent messiness of real-world data rather than fighting it.
💬 Join Distributed Data Community Slack
📚 Check out Daft Engineering Blog
📲 Follow Daft on LinkedIn & Twitter
💜 We’re hiring, join our team
About ClickHouse
Established in 2009, ClickHouse leads the industry with its open-source column-oriented database system, driven by the vision of becoming the fastest OLAP database globally. The company empowers users to generate real-time analytical reports through SQL queries, emphasizing speed in managing escalating data volumes.
📚 Get started on ClickHouse Github
💬 Join the ClickHouse Slack Channel
💜 We’re hiring, join our team!
About MinIO
MinIO is the company behind AIStor, the world’s most widely adopted exascale object store for enterprise AI data, agentic computing, and analytics. Trusted by 77% of the Fortune 100 and built for performance at scale, AIStor unifies structured and unstructured data in a single, consistent system. It's object-native, hybrid by design, and fully S3-compatible. Run it anywhere: from edge to core to cloud.
Whether you're training massive models, deploying AI agents, or scaling your data lakehouse, AIStor delivers the speed, control, and scale your workloads demand.
📲 Try MinIO AIStor •
📚 Check Out The Definitive Guide to Lakehouse Architecture with Iceberg and AIStor
💬 Follow MinIO on LinkedIn & X
About Buf
Buf is driving the shift towards universal schema adoption across your entire stack — from your network APIs, to your streaming data, to your data lake. Our enterprise-grade Kafka and gRPC services include the Buf Schema Registry, the Bufstream streaming data platform and a host of open source projects that make Protobuf work for everyone.
Get started today:
📲 Buf Schema Registry Quickstart
🖥️ Bufstream Quickstart
📚 Buf GitHub
About RisingWave
RisingWave Labs, founded in 2021 in San Francisco, develops RisingWave, a cloud-native SQL streaming database that simplifies real-time data processing. The company’s technology combines PostgreSQL compatibility with modern streaming architecture, offered both as an open-source solution and as RisingWave Cloud, a fully-managed platform.
📚 Visit RisingWave website
☁️ Try RisingWave Cloud
💬 Join RisingWave Slack Community
📲 Follow RisingWave on LinkedIn and Twitter
🖥️ Subscribe to RisingWave YouTube channel