Cover Image for CAIA Speaker Event: Jerry Wei (Anthropic)
Cover Image for CAIA Speaker Event: Jerry Wei (Anthropic)
171 Went

CAIA Speaker Event: Jerry Wei (Anthropic)

Hosted by Adarsh Muthiah Kumarappan
Registration
Past Event
Welcome! To join the event, please register below.
About Event

Here are some details on Caltech AI Alignment’s next speaker event:

  1. Who: Jerry Wei (in person), Anthropic

  2. When: June 3rd at 4-5 pm PT

  3. Where: Broad 100

  4. What: Jerry Wei is an AI researcher at Anthropic (formerly Google DeepMind) who works on improving language model capabilities and alignment. His talk will focus on his work on "Constitutional Classifiers" - machine learning systems that detect and block "jailbreak" attempts where users try to bypass safety training to get harmful outputs. These classifiers prove significantly more robust against manipulation than the language models they protect, withstanding thousands of hours of human jailbreaking attempts.

No specific technical background is required - we welcome all interested students who are eager to learn! As with all CAIA events, we will have pizza and boba!

Location
Broad Center for the Biological Sciences
96, California Institute of Technology, 360 S Wilson Ave, Pasadena, CA 91106, USA
171 Went