Cover Image for AI Safety Thursdays: Understanding The Self-Other Overlap Approach

Presented by

Catalyzing Toronto's role in steering AI progress toward a future of human flourishing. Join us for a variety of events on technical AI safety, governance in a world of advanced AI, and more.

Hosted By

39 Went

AI

Featured in

Toronto

AI Safety Thursdays: Understanding The Self-Other Overlap Approach

Name: AI Safety Thursdays: Understanding The Self-Other Overlap Approach
Start: 2025-05-22T18:00:00.000-04:00
End: 2025-05-22T21:00:00.000-04:00
Location: 30 Adelaide St E

Trajectory Labs

30 Adelaide St E

Toronto, Ontario

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Description

Leo Zovic presents on a less-explored technique that optimizes models to maintain similar internal representations when reasoning about themselves and others.

This scalable approach not only reduces deceptive behavior in AI systems but can perfectly classify deceptive agents based on their self-other overlap values.

Event Schedule

6:00 to 6:45 - Networking and refreshments
6:45 to 8:00 - Main Presentation
8:00 to 9:00 - Breakout Discussions

Location