Generating a preference dataset for DPO/ORPO and cleaning it with AI feedback
Registration
Past Event
About Event
Join us for an in-depth session on distilabel, the framework for synthetic data and AI feedback!
In this session, we'll walk you through the essentials of building a distilabel pipeline by exploring two key use cases: cleaning an existing dataset and generating a preference dataset for DPO/ORPO. You’ll also learn how to make the most of it, integrating Argilla to gather human feedback and improve its quality.
This session is perfect for you
if you’re getting started with distilabel or synthetic data
if you want to discover new functionalities
if you want to provide us with new feedback