Cover Image for Scalable Evaluation of Large Language Models: A UPenn Lecture | Sponsored by Turing

Presented by

Unleashing the world's untapped human potential to accelerate AGI. Solving the human intelligence bottleneck with genAI products and solutions.

Hosted By

316 Went

Scalable Evaluation of Large Language Models: A UPenn Lecture | Sponsored by Turing

Turing Events

Zoom

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

You are invited to join a special virtual stream of Prof. Mayur Naik’s CIS 7000 course on Large Language Models at the University of Pennsylvania, sponsored by Turing.

This session will delve into scalable approaches for evaluating large language models (LLMs), like ChatGPT, using other LLMs as evaluators. Yann Dubois will discuss the unique challenges of evaluating open-ended LLM outputs, the potential benefits of leveraging one model to assess another, and strategies for overcoming common limitations.

Speaker Bio: Yann Dubois is a final-year Ph.D. student in computer science at Stanford University, advised by Percy Liang and Tatsu Hashimoto. His research focuses on optimizing AI performance when resources are scarce, with notable contributions to the Alpaca project, which aims to improve LLM training and evaluation efficiency.

This virtual stream opens up the classroom experience to a broader audience beyond the UPenn students. Anyone interested in AI, machine learning, or the cutting-edge techniques behind large language models will find this talk insightful and valuable.

Learn more about Prof. Mayur Naik’s course here.