Recruit the World’s best for AGI Alignment

Greg_Colbourn ⏸️

Recruit the World’s best for AGI Alignment

Greg_Colbourn ⏸️

26 min readMar 30, 2023

Comments 8

Sorted by

New & upvoted

anonymous6

Right now it seems like AI safety is more of a scene (centered around the Bay Area) than a research community. If you want to attract great scientists and mathematicians (or even mediocre scientists and mathematicians), something even more basic than coming up with good "nerd-sniping" problems is changing this. There are many people who could do good work in technical AI safety, but would not fit in socially with EAs and rationalists.

I'm sympathetic to arguments that formal prepublication peer review is a waste of time. However, I think formalizing and writing up ideas to academic standards, such that they could be submitted for peer review, is definitely a very good use of time, it's not even that hard, and there should be more of it. This would be one step towards making a more bland, boring, professionalized research community where a wider variety of people might want to be involved.

Greg_Colbourn ⏸️

Let's hope that AGI Alignment goes mainstream in the coming months. I think it has a good chance with the Pause letter, and the wide reporting it has had in the mainstream media.

Daniel_Friedrich

If Big Tech finds these kinds of salaries cost-effective to solve their problems, I would consider it a strong argument in favor of this project.
I imagine Elon Musk could like this project given that he believes in small effective teams of geniuses.
I'd say "polymaths" is a good label for people I'd expect to make progress like Yudkowsky, Bostrom, Hanson and von Neumann.
1. Edit: This may be fame-selection (engineers don't often get credit, particularly in teams) or self-selection (interest in math+society).
The Manhattan and Enigma projects seem like examples where this kind of strategy just worked out. Some consideration that come to mind:
1. There could be selection effects.
2. From what I can find, members of these teams weren't lured in by a lot of money. However, the salience of the AI threat in society is tiny, compared to that of WWII and large incentives could compensate that.
3. I've read money can sometimes decrease intrinsic motivation, that drives exploration & inventions, however these findings are being rebutted by newer studies. Apart from that, my guess would be that getting those teams together is the key part and if large money can facilitate that, great.
A wild idea that might help in case a similar phenomenon works in the sub-population of geniuses & which could make this project more appealing to donors: Limit a portion of these salaries, so that the recipients could only use them for socially beneficial uses.

Chris バルス

Thanks for sharing the idea. Question: you've written that there wasn't sufficient interest, and I assume that includes OP, SFF, LTFF, (...). Is that correct? Wouldn't at least a weak form / Pilot run of this be an attractive idea to them? I'd be surprised if there weren't any interest by these actors. If there weren't any interest, what was the reason (if not confidential)?

Thanks.

Greg_Colbourn ⏸️

People from those orgs were aware, but none were keen enough about the idea to go as far as attempting a pilot run (e.g. the 2 week retreat idea). I think general downside risk aversion was probably a factor. This was in the pre-chatGPT days of a much narrower Overton Window though, so maybe it's time for the idea to be revived? On the other hand, maybe it's much less needed now there is government involvement, and national AI Safety Institutes attracting top talent.

Greg_Colbourn ⏸️

Also, in general I'm personally much more sceptical of such a moonshot paying off, given shorter timelines and the possibility that x-safety from ASI may well be impossible. I think OP was 2022's best idea for AI Safety. 2024's is PauseAI.

GreenFrog

If you give me $250,000, I will work on AI alignment independently for a year. (Serious offer!) I'm not Terence Tao or Von Neumann, but I did graduate with a master's degree in CS/ML from Oxford so have at least some knowledge of ML. I think getting people like me to work on it is more realistic and potentially higher reward because you can get people who will actually work on it, rather than the slim chance of getting Terence Tao on board, and can have a greater number of hands to complete the tedious work and experiments that need to be done

Greg_Colbourn ⏸️

I think for that money you're going to need to prove that you're worth it - can you link to any of your work? Also, as per my note at the top of the OP, I think that there basically isn't time to spin up an alignment career now, so unless you are a genius or have some novel insights into the problem already, then I'm not very hopeful that your work could make a difference at this late stage. I'm more excited about people pivoting to work on getting a global AGI moratorium in place asap. Once we have that, then we can focus on a "Manhattan Project" for Alignment.

Comments

Recruit the World’s best for AGI Alignment

Recruit the World’s best for AGI Alignment

Summary:

What is the idea?

Where have I seen this before?

Isn’t this crazy?

(Why are your AI timelines short?

(What if my timelines aren’t short?

Ok, so maybe it’s not totally crazy. But how will we get the people that matter (i.e. the potential grantees) to take the idea seriously?

But the people targeted by this grant mostly aren’t interested in money..

What about their existing careers and the investments that went into those?

What about their existing employers?

But surely hedge funds and top companies have offered these people tons of money to work for them already?

How else can we turn (lots of) money into AGI Alignment work done by the very best and brightest?

Maybe we should just ask them what it would take to get them working on Alignment?

(If nerd-sniping is required, how can we do that?

In what ways is this different from other problems, like climate change?

Is Pure Maths ability what we want? How do we identify the World’s best talent for the job?

Aren’t most of the people who’ve won the biggest prizes (such as Nobels and Fields Medals) too old?

Aren’t we in danger of Goodharting on prior accomplishments here?

Seriously, “you cannot just pay $5 million apiece to a bunch of legible geniuses from other fields and expect to get great alignment work out of them.”(!)

Won’t they need a lot of time to get up to speed?

Won’t they need supervising?

What about publishing incentives?

How should this be organised?

Top AGI Safety researchers’ time is highly valuable, is this a good use of it?

Is hiring lone geniuses the way to go, or should we be thinking about finding people who work well together?

Where would this project be located?

Is this cost-effective?

Could this backfire?

How else could it go wrong?

But there could be good second-order effects too, right?

What should this be called?

How do we get it off the ground?

The TerryTaoDAO?

“Avengers assemble”?

Hold on, the goal we are aiming for is existential safety from AGI, and this is bigger than just Alignment. You mention governance above, what about that?