Share your views in the comments!
To make this clear and easy to follow, please use these guidelines:
- Use the template below.
- Post as many items as you want.
- One item per comment, so that it's easy for people to read and react.
- (Optional, encouraged) Highlight at least one of your own contributions.
If you need some inspiration, open your EA Forum Wrapped and scroll to the bottom of your "Strong Upvoted" list.
Template
Title:
Author:
URL:
Why it's good:
If you're sharing an underrated comment, set the title to "[Username] on [topic]".
Title: Paul Christiano on how you might get consequentialist behavior from large language models
Author: Paul Christiano
URL: https://forum.effectivealtruism.org/posts/dgk2eLf8DLxEG6msd/how-would-a-language-model-become-goal-directed?commentId=cbJDeSPtbyy2XNr8E
Why it's good: I think lots of people are very wrong about how LLMs might lead to consequentialist behavior, and Paul's comment here is my favorite attempt at answering this question. I think that this question is extremely important.