Personal

Join us for a practical walkthrough of how LLM-evaluators, also known as “LLM-as-a-Judge”, are transforming model evaluation. We’ll also explore key use cases, prompting techniques, and see how it can be used to choose between the newest line of SoTA open Kimi, Qwen and GLM models!🔥

This session is designed to help you scale evaluation workflows, improve consistency in model assessment, and reduce reliance on manual human annotation.

LLM-as-a-Judge Evals: Comparing Kimi, Qwen, and GLM