Cover Image for AI Safety Thursdays: Agentic Misalignment: How LLMs could be insider threats
Cover Image for AI Safety Thursdays: Agentic Misalignment: How LLMs could be insider threats
Avatar for Trajectory Labs
Presented by
Trajectory Labs
1 Going

AI Safety Thursdays: Agentic Misalignment: How LLMs could be insider threats

Registration
Welcome! To join the event, please register below.
About Event

Can AI agents misbehave while carrying out actions autonomously? At this event, Giles Edkins will guide us through a look at and critique some research by Anthropic that demonstrates blackmail and other phenomena when an agent is threatened with shutdown or reprogramming.

​​​Event Schedule
6:00 to 6:45 - Food & Networking
6:45 to 8:00 - Main Presentation & Questions
8:00 9:00 - Discussion

Location
30 Adelaide St E 12th floor
Toronto, ON M5C, Canada
Enter the main lobby of the building and let the security staff know you are here for the AI meetup. You may need to show your RSVP on your phone. You will be directed to the 12th floor where the meetup is held. If you have trouble getting in, give Juliana a call at 647-544-0993.
Avatar for Trajectory Labs
Presented by
Trajectory Labs
1 Going