Cover Image for CAIA Speaker Series: Alex Turner
Cover Image for CAIA Speaker Series: Alex Turner
Avatar for Speaker Series
Presented by
Speaker Series
4 Went
Registration
Past Event
Welcome! To join the event, please register below.
About Event
  1. Who: Alex Turner, Research scientist at Google DeepMind

  2. ​When: April 4th at 5-6 pm PT

  3. ​Where: ANB 121

  4. ​What: Your AI’s training data might make it more “evil” and more able to circumvent your security, monitoring, and control measures. Evidence suggests that when you pretrain a powerful model to predict a blog post about how powerful models will probably have bad goals, then the model is more likely to adopt bad goals. I discuss ways to test for and mitigate these potential mechanisms. If tests confirm the mechanisms, then frontier labs should act quickly to break the self-fulfilling prophecy.

​No specific technical background is required - we welcome all interested students who are eager to learn! As with all CAIA events, we will have pizza and boba!

Location
Annenberg Center for Information Science and Technology
330 S Chester Ave, Pasadena, CA 91125, USA
Avatar for Speaker Series
Presented by
Speaker Series
4 Went