Why Do Multi-Agent LLM Systems Fail?

Bilkent Yapay Zeka Topluluğu

Virtual

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Join the Bilkent AI Society’s 11th Seminar Featuring Mert Cemri (UC Berkeley)

Title: Why Do Multi-Agent LLM Systems Fail? A Deep Dive into MAST and the Future of AI Agents

We are thrilled to host Mert Cemri, PhD researcher at UC Berkeley and co-author of the influential paper “Why Do Multi-Agent LLM Systems Fail?”, for a thought-provoking session on the hidden challenges of building large language model (LLM)-based multi-agent systems.

Despite the growing hype around agent-based AI, recent evidence shows that these systems often fail more than they succeed. Why? In this talk, Mert will unveil MAST (Multi-Agent System Failure Taxonomy), the first empirically grounded framework to systematically understand why multi-agent systems underperform.

Learn how over 200 real-world MAS traces were analyzed to extract 14 failure types, categorized under specification issues, inter-agent misalignment, and verification breakdowns. Discover how even top-tier systems like ChatDev and MetaGPT struggle with subtle coordination bugs, overlooked specifications, and flawed verification processes.

The talk will also feature insights from the development of an LLM-as-a-judge evaluation pipeline, and how MAS failures relate more to organizational design than model capabilities, a critical shift in thinking.

🔗 Who Should Attend:

AI and LLM researchers interested in multi-agent architectures
Engineers and builders working on AI agents and orchestration
Researchers focused on evaluation, trustworthiness, and AI failure analysis
Anyone exploring the future of robust and scalable AI systems

📅 Date: June 19, 2025

🕔 Time: 19:00 TRT (GMT+3)

📍 Location: Online (Register to get the meeting link)

🌐 Register now for free: https://register.bilkentai.com

This session promises to redefine how we think about AI agents, from architecture to accountability. Don’t miss this chance to engage with cutting-edge research shaping the future of intelligent systems.

Presented by

Bilkent Yapay Zeka Topluluğu

Hosted By

88 Went

AI