Cover Image for AI Safety Thursdays: Understanding The Self-Other Overlap Approach
Cover Image for AI Safety Thursdays: Understanding The Self-Other Overlap Approach
Avatar for Trajectory Labs
Presented by
Trajectory Labs
Hosted By
30 Going

AI Safety Thursdays: Understanding The Self-Other Overlap Approach

Registration
Welcome! To join the event, please register below.
About Event

​​Description

Leo Zovic presents on a less-explored technique that optimizes models to maintain similar internal representations when reasoning about themselves and others.

This scalable approach not only reduces deceptive behavior in AI systems but can perfectly classify deceptive agents based on their self-other overlap values.

​​Event Schedule

6:00 to 6:45 - Networking and refreshments
6:45 to 8:00 - Main Presentation
8:00 to 9:00 - Breakout Discussions

Location
30 Adelaide St E
Toronto, ON M5C 3G8, Canada
Enter the main lobby of the building and let the security staff know you are here for the AI meetup. You may need to show your RSVP on your phone. You will be directed to the 12th floor where the meetup is held. If you have trouble getting in, give Smitty a call at 647-424-4111.
Avatar for Trajectory Labs
Presented by
Trajectory Labs
Hosted By
30 Going