Vasco Grilo

5622 karmaJoined Jul 2020Working (0-5 years)Lisbon, Portugal

sites.google.com/view/vascogrilo?usp=sharing

Message

Bio

Participation
4

How others can help me

You can give me feedback here (anonymous or not). You are welcome to answer any of the following:

Do you have any thoughts on the value (or lack thereof) of my posts?
Do you have any ideas for posts you think I would like to write?
Are there any opportunities you think would be a good fit for me which are either not listed on 80,000 Hours' job board, or are listed there, but you guess I might be underrating them?

How I can help others

Feel free to check my posts, and see if we can collaborate to contribute to a better world. I am open to part-time volunteering, and part-time or full-time paid work. In this case, I typically ask for 20 $/h, which is roughly equal to 2 times the global real GDP per capita.

Posts
109

Sorted by New

Anarchist Theory FAQ by Bryan Caplan

Vasco Grilo

· 4d ago · 1m read

Dispersion in the extinction risk predictions made in the Existential Risk Persuasion Tournament

Vasco Grilo

· 9d ago · 4m read

Founders Pledge’s Climate Change Fund might be more cost-effective than GiveWell’s top charities, but it is much less cost-effective than corporate campaigns for chicken welfare?

Vasco Grilo

· 15d ago · 11m read

Imitation Learning is Probably Existentially Safe

Vasco Grilo

· 19d ago · 3m read

Animals in Cost-Benefit Analysis

Vasco Grilo

· 24d ago · 1m read

Saving lives in normal times is better to improve the longterm future than doing so in catastrophes?

Vasco Grilo

· 1mo ago · 11m read

Concepts of existential catastrophe

Vasco Grilo

· 1mo ago · 10m read

Should Open Philanthropy build detailed quantitative models which estimate global catastrophic risk?

Vasco Grilo

· 1mo ago · 1m read

Brian Tomasik on personal effectiveness

Vasco Grilo

· 1mo ago · 6m read

Should the main text of the write-ups of Open Philanthropy’s large grants be longer than 1 paragraph?

Vasco Grilo

· 2mo ago · 1m read

Comments
1298

Topic contributions
25

The bullseye framework: My case against AI doom

Vasco Grilo6h4

Thanks for the clarification, Erich! Strongly upvoted.

Let me see if I can rephrase your argument

I think your rephrasement was great.

Now I'm a bit unsure about whether you're saying that you find it extremely unlikely that any AI will be vastly better in the areas I mentioned than all humans, or that you find it extremely unlikely that any AI will be vastly better than all humans and all other AIs in those areas.

The latter.

If you mean 1-4 to suggest that no AI is will be better than all humans and other AIs, I'm not sure about whether 4 follows from 1-3, but I think that seems plausible at least. But if this is what you mean, I'm not sure what you're original comment ("Note humans are also trained on all those abilities, but no single human is trained to be a specialist in all those areas. Likewise for AIs.") was meant to say in response to my original comment, which was meant as pushback against the view that AGI would be bad at taking over the planet since it wouldn't be intended for that purpose.

I think a single AI agent would have to be better than the vast majority of agents (including both human and AI agents) to gain control over the world, which I consider extremely unlikely given gains from specialisation.

If you mean 1-4 to suggest that no AI will be better than all humans, I don't think the analogy holds, because the underlying factor (IQ versus AI scale/algorithms) is different. Like, it seems possible that even unspecialized AIs could just sweep past the most intelligent and specialized humans, given enough time.

I agree.

I'd be curious to hear if you have thoughts about which specific abilities you expect an AGI would need to have to take control over humanity that it's unlikely to actually possess?

I believe the probability of a rogue (human or AI) agent gaining control over the world mostly depends on its level of capabilities relative to those of the other agents, not on the absolute level of capabilities of the rogue agent. So I mostly worry about concentration of capabilities rather than increases in capabilities per se. In theory, the capabilities of a given group of (human or AI) agents could increase a lot in a short period of time such that capabilities become so concentrated that the group would be in a position to gain control over the world. However, I think this is very unlikely in practice. I guess the annual probability of human extinction over the next 10 years is around 10^-6.

[Draft] The humble cosmologist's P(doom) paradox

Vasco Grilo16h3

Hi titotal,

I think it makes sense to assess the annual risk of simulation shutdown based on the mean annual probability of simulation shutdown. However, I also believe the risk of simulation shutdown is much lower than the one guessed by the humble cosmologist.

The mean of a loguniform distribution ranging from a to 1 is -1/ln(a). If a = 10^-100, the risk is 0.434 % (= -1/ln(10^-100)). However, I assume there is no reason to set the minimum risk to 10^-100, so the cosmologist may actually have been overconfident. Since there is no obvious natural lower bound for the risk, because more or less by definition we do not have evidence about the simulators, I guess the lower bound can be arbitrarily close to 0. In this case, the mean of the loguniform distribution goes to 0 (= -1/(ln(0))), so it looks like the humblest view corresponds to 0 risk of simulation shutdown.

In addition, the probability of surviving an annual risk of simulation shutdown of 0.434 % (= 10^-5.44) over the estimated age of the universe of 13.8 billion years is only 10^-75,072,000,000 (= (10^-5.44)^(13.8*10^9)), which is basically 0. So the universe would need to be super super lucky in order to have survived for so long with such high risk. One can try to counter this argument saying there are selection effects. However, it would be super strange to have an annual risk of simulation shutdown of 0.434 % without any partial shutdowns, given that tail risk usually follows something like a power law^[1] without severe jumps in severity.

^{^}
Although I think tail risk often decays faster than suggested by a power law.

The Leeroy Jenkins principle: How faulty AI could guarantee "warning shots"

Vasco Grilo17h2

Nice post, titotal!

This could be a whole post in itself, and in fact I’ve already explored it a bit here.

The link is private.

The bullseye framework: My case against AI doom

Vasco Grilo18h2

I would expect improvements on these types of tasks to be highly correlated in general-purpose AIs.

Higher IQ in humans is correlated with better performance in all sorts of tasks too, but the probability of finding a single human performing better than 99.9 % of (human or AI) workers in each of the areas you mentioned is still astronomically low. So I do not expect a single AI system to become better than 99.9 % of (human or AI) workers in each of the areas you mentioned. It can still be the case that the AI systems share a baseline common architecture, in the same way that humans share the same underlying biology, but I predict the top performers in each area will still be specialised systems.

I think we've seen that with GPT-3 to GPT-4, for example: GPT-4 got better pretty much across the board (excluding the tasks that neither of them can do, and the tasks that GPT-3 could already do perfectly). That is not the case for a human who will typically improve in just one domain or a few domains from one year to the next, depending on where they focus their effort.

Going from GPT-3 to GPT-4 seems more analogous to a human going from 10 to 20 years old. There are improvements across the board during this phase, but specialisation still matters among adults. Likewise, I assume specialisation will matter among frontier AI systems (although I am quite open to a single future AI system being better than all humans at any task). GPT-4 is still far from being better than 99.9 % of (human or AI) workers in the areas you mentioned.

The bullseye framework: My case against AI doom

Vasco Grilo1d2

For an agent to conquer to world, I think it would have to be close to the best across all those areas, but I think this is super unlikely based on it being super unlikely for a human to be close to the best across all those areas.

The bullseye framework: My case against AI doom

Vasco Grilo1d2

Hi Erich,

Note humans are also trained on all those abilities, but no single human is trained to be a specialist in all those areas. Likewise for AIs.

The bullseye framework: My case against AI doom

Vasco Grilo1d2

Great post, titotal!

Point 1: Early AI will be buggy as hell
Full article:
https://forum.effectivealtruism.org/posts/pXjpZep49M6GGxFQF/the-first-agi-will-be-a-buggy-mess

The link is broken.

EA Forum feature suggestion thread

Vasco Grilo2d2

Hi JP,

It would be nice to have the possibility of filtering the posts of a user by a given tag. As of now, it is not possible.

AGI Battle Royale: Why “slow takeover” scenarios devolve into a chaotic multi-AGI fight to the death

Vasco Grilo2d2

For reference, here is a seemingly nice summary of Fearon's "Rationalist explanations for war" by David Patel.

AGI Battle Royale: Why “slow takeover” scenarios devolve into a chaotic multi-AGI fight to the death

Vasco Grilo3d2

Nice point, Robi! That being said, it seems to me that having many value handshakes correlated with what humans want is not too different from historical generational changes within the human species.

Vasco Grilo

Bio

Participation4

How others can help me

How I can help others

Posts 109

Comments1298

Topic contributions25

Participation
4

Posts
109

Comments
1298

Topic contributions
25