

DeployCon: Live Stream
Serving a model isn’t the finish line—it’s the starting gun.
Welcome to DeployCon, a free, no-fluff, engineer-first summit for teams building the next generation of AI. We’re bringing together top minds in LLMops to share how they deploy, scale, and evolve AI systems in production.
This is a live event that will be streamed online. If you'd like to attend in person in San Francisco register here.
🎙️ Speakers and Talks
AI Leaders from Pinterest, DoorDash, Tinder, Nvidia, ConverseNow, and AWS will lead the following talks and panels:
Turbo-Scaling GenAI at DoorDash: From Billions of SKUs to Real-Time Personalization
Creating Deeper and Safer Human Connections with GenAI
Productionizing Prompts: How Pinterest Turned Every Team into GenAI Power Users
The Future of Production AI: Moving from POC to Production AI that Thinks for Itself and Continuously Improves
Evolving World of Open-source and Applied AI: New Model Architectures, Agentic Orchestration and Inference Optimizations
🚀 Why Attend?
Forward-looking, Battle-tested LLMOps Strategies
Hear how AI leaders and teams deploy, autoscale, and monitor large language models and AI agents under real-world traffic.
Post-training in the Wild
Learn how top teams use live feedback loops, rewards-based systems, and fine-tuning to make small open-source models reason like an expert.
Latency, Cost, and Scale Tactics
Get the playbooks for squeezing microseconds out of existing GPUs and maximize throughput with the latest techniques from research.
Stories from the Front Lines
AI practitioners share what breaks, what bends, and what wins when GenAI meets production with real-world case studies.
📅 Agenda
10:30: Lightning Talks and Panels
1:30: Event Concludes
By registering for this event, I understand Predibase will process my information in accordance with their Privacy Policy.