Pluralis Reading Group - DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Hosted by Sameera Ramasighe
Registration
Past Event
About Event
This week's paper: DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Link: https://arxiv.org/pdf/2503.14476