Generative AI-focused workshops, hackathons, and more. Come build with us!

Arize AI

Step into the world of LLM evaluations with a 3-part series dedicated to achieving production excellence. We’ll unpack advanced evaluation techniques and best practices formulated through rigorous testing — spanning retrieval, summarization, and hallucination — to help ensure production readiness. A must-attend for AI & ML engineers and data scientists. 

Session 1 (10/3): Benchmarking and Analyzing Retrieval Approaches

Session 2 (10/10): Statistical Analysis of Summarization LLM Evaluations

Session 3 (10/16): Statistical Analysis of Hallucination LLM Evaluations

LLM Evaluation Essentials

Dominik Scherm

otto