Details
Tonight's Topic: Can AI be truly honest? Representation Engineering: A Top-Down Approach to AI Transparency is a paper released from October that delves into this question. It examines innovative research aimed at enhancing AI's truthfulness. Join us this week in discussing methods to detect and manipulate AI honesty and the broader implications for AI safety.
About the Toronto AI Safety Meetup: Expect debate, presentations, some technical content and the occasional practical (programming) workshops. Each week we'll let you know what to expect as the different events will each have a different flavour.
We welcome a variety of backgrounds, opinions, and experience levels.
Getting here: Enter the lobby at 100 University Ave (right next to St Andrew subway station), and message Giles Edkins on the meetup app or call him on 647-823-4865 to be let up to room 6H.