Cover Image for Webinar: How to use LLM-as-a-judge to evaluate LLM systems
Cover Image for Webinar: How to use LLM-as-a-judge to evaluate LLM systems

Webinar: How to use LLM-as-a-judge to evaluate LLM systems

Hosted by Evidently AI & Elena Samuylova
Zoom
Registration
Welcome! To join the event, please register below.
About Event

How do you evaluate the quality of an LLM-powered system, like a chatbot or AI agent? 

Judging the quality of open-ended text outputs is tricky. Traditional machine learning metrics like accuracy and rule-based checks simply don’t work well for free-form texts. On the other hand, scoring every LLM response by human evaluators is costly and unscalable. One of the ways to address the issue is to use another LLM to evaluate the outputs of your AI system. This approach is called "LLM-as-a-judge."

So what exactly an LLM judge is and how to use it to evaluate generative AI applications? Join our webinar to learn how to create, tune, and evaluate LLM judges. 

What we will cover:

  • What are LLM evaluations and when do you need them?

  • Different types of LLM evaluations: offline and online evals.

  • What is "LLM-as-a-judge"?

  • How to create an LLM judge in 5 simple steps?

  • What makes an effective evaluation prompt?

  • How to apply an LLM judge and evaluate its performance?

About the speaker:
Elena Samuylova is a CEO and Co-founder at Evidently AI, the company behind Evidently, an open-source framework for ML and LLM evaluation and observability with over 20 million downloads.

She has been active in the applied ML space for over 10 years. Previously, she co-founded and served as a CPO of an industrial AI startup, implementing machine learning for production optimization for global metal and chemical companies. Prior to that, she led business development at Yandex Data Factory, an enterprise AI division of Yandex. She focused on delivering ML-based solutions to retail, banking, telecom, and other industries. In 2018, Elena was named 50 Women in Product Europe by Product Management Festival.