CAIA Speaker Series: Alex Turner

Name: CAIA Speaker Series: Alex Turner
Start: 2025-04-04T17:00:00.000-07:00
End: 2025-04-04T18:00:00.000-07:00
Location: Annenberg Center for Information Science and Technology

Speaker Series

Annenberg Center for Information Science and Technology

Pasadena, California

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Who: Alex Turner, Research scientist at Google DeepMind
When: April 4th at 5-6 pm PT
Where: ANB 121
What: Your AI’s training data might make it more “evil” and more able to circumvent your security, monitoring, and control measures. Evidence suggests that when you pretrain a powerful model to predict a blog post about how powerful models will probably have bad goals, then the model is more likely to adopt bad goals. I discuss ways to test for and mitigate these potential mechanisms. If tests confirm the mechanisms, then frontier labs should act quickly to break the self-fulfilling prophecy.

No specific technical background is required - we welcome all interested students who are eager to learn! As with all CAIA events, we will have pizza and boba!

Location

Annenberg Center for Information Science and Technology

330 S Chester Ave, Pasadena, CA 91125, USA

Presented by

Speaker Series

Hosted By

4 Went

AI