CAIA Speaker Event: Jerry Wei (Anthropic)

Name: CAIA Speaker Event: Jerry Wei (Anthropic)
Start: 2025-06-03T16:00:00.000-07:00
End: 2025-06-03T17:00:00.000-07:00
Location: Broad Center for the Biological Sciences

Hosted by Adarsh Muthiah Kumarappan

Broad Center for the Biological Sciences

Pasadena, California

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Here are some details on Caltech AI Alignment’s next speaker event:

Who: Jerry Wei (in person), Anthropic
When: June 3rd at 4-5 pm PT
Where: Broad 100
What: Jerry Wei is an AI researcher at Anthropic (formerly Google DeepMind) who works on improving language model capabilities and alignment. His talk will focus on his work on "Constitutional Classifiers" - machine learning systems that detect and block "jailbreak" attempts where users try to bypass safety training to get harmful outputs. These classifiers prove significantly more robust against manipulation than the language models they protect, withstanding thousands of hours of human jailbreaking attempts.

No specific technical background is required - we welcome all interested students who are eager to learn! As with all CAIA events, we will have pizza and boba!

Location

Broad Center for the Biological Sciences

96, California Institute of Technology, 360 S Wilson Ave, Pasadena, CA 91106, USA

Hosted By

171 Went

AI