I do independent research on EA topics. I write about whatever seems important, tractable, and interesting (to me).
I have a website: https://mdickens.me/ Much of the content on my website gets cross-posted to the EA Forum, but I also write about some non-EA stuff over there.
I used to work as a software developer at Affirm.
Mass third world immigration threatens advanced civilization - This is the most controversial point, but I think it is intuitive. Influxes of third worlders into Europe have caused a large amount of political and social unrest that has come to a head in the past decade and a half. Statistics from Denmark show that pretty much all third world migrants are a net negative on how much they contribute versus how much is spent on them by the state, while first worlders are a net positive.
Seems like you are cherry-picking one particular statistic about state welfare expenditures and ignoring all other relevant evidence. Most importantly, you are ignoring the consensus among economists that immigration increases economic prosperity.
If you will allow me to respond to your cherry-picked evidence with my own cherry-picked evidence: The United States consists of 97% immigrants or descendants of immigrants, and it's the wealthiest country in the world.
Robust alignment requires alignment-relevant intervention during pretraining
I'd say this is the wrong question. Like, I do not expect that any current alignment approach is going to work. If we do ever figure out what works, it will not look like "pretraining" or "post-training", it will be something completely different.
Although I guess you could call that "pretraining"?
Some commentary. I mostly agree with the page, but I will focus on the bits where I see room for improvement:
*this is pessimistic for donations but I would actually prefer that this happen because it would lengthen timelines. so in a way it's the optimistic outcome
I'd never thought of this argument, and it's obviously correct in retrospect.
Although "trying less hard" might not quite be pointing at the right thing. Reflecting on your epistemics / course of action could still be considered a form of "trying hard", so maybe it would be better to describe it as "rocketing in a particular pre-determined direction less hard".
Two quick thoughts:
I think section 10 is pointing at something similar. I find it at least somewhat plausible that RL on risk aversion generalizes better than other kinds of RL. I would still be surprised if we could get risk aversion to generalize to ASI using anything resembling current techniques, but this seems like a better-than-average idea for preventing AI takeover.