[Meetup] AI Agent Evaluation: Techniques for Building Better Agent Systems
As AI agents become increasingly sophisticated, evaluating their performance is essential for driving meaningful improvements and ensuring reliability in real-world applications. Join us for an in-depth exploration of AI agent evaluation, where experts from Google, Arize, LlamaIndex, and others working on the emerging AI agent & assistants tech stack will share advanced techniques for building and optimizing agent systems.
This event will delve into evaluation methodologies, engineering learnings from the trenches, and the latest research focused on the nuanced behaviors of AI agents in dynamic environments.
Through a series of tech talks from Google, Arize, LlamaIndex, Priceline, AutoGen and Weaviate, we’ll explore:
Advanced evaluation techniques for AI agents, focusing on real-world performance metrics.
Key methods for optimizing agent systems to improve decision-making, autonomy, and robustness.
The latest tools and frameworks used in agent development and system evaluation.
Discover best practices for iterating on agent behavior through continuous evaluation and feedback loops.
🥂 Complimentary food and drinks will be provided!
🌟 Fun swag and special giveaways await!
Agenda
5:00pm: Check-in & Networking
6:30pm: Tech Talks
Google Cloud: Stephen Orban, VP, Migrations, ISVs, & Marketplace
Arize AI: Jason Lopatecki, Co-Founder & CEO
AutoGen: Optimize Agentic AI Systems
Chi Wang, Founder
LlamaIndex - Agentic RAG with LlamaIndex Workflows
Luke Chui, Founding Engineer
Weaviate - Building Agentic RAG Systems
Erika Cardenas, Technology Partner Manager
7:30pm: Fireside Chat
Priceline: Rajy Tanneeru, Distinguished Architect
Arize AI: Jason Lopatecki, Co-Founder & CEO
Google
8:00pm - 9:00pm: Networking & Giveaways