Non-EA interests include chess and TikTok (@benthamite). We are probably hiring: https://metr.org/hiring
Feedback always appreciated; feel free to email/DM me or use this link if you prefer to be anonymous.
Fair enough! My guess is that when the trend breaks it will be because things have gone super-exponential rather than sub-exponential (some discussion here) but yeah, I agree that this could happen!
Thanks for writing this up! I really like when people do concrete empirical surveys like this, it's helpful to get a sense of how widely current tools are actually being used.
I'm curious if you have thoughts about what automation would actually speed you up? It sounds like maybe something like "current LLMs but without hallucination?"
Also, do you have a sense for how much investment has been made into AI tools in CEST? My impression is that deepmind really loves getting into nature/science but has very little interest in actually commercializing these tools, so it feels not that surprising to me that the thing which got into science didn't actually get used.[1] It would update me if they tried very hard to commercialize it but failed.
I agree that this doesn't speak well of the editorial process though
It feels appropriate that this post has a lot of hearts and simultaneously disagree reacts. We will miss you, even (perhaps especially) those of us who often disagreed with you.
I would love to reflect with you on the other side of the singularity. If we make it through alive, I think there's a decent chance that it will be in part thanks to your work.
I was excited that they did this and thought it was well produced. The focus on cost cutting feels like a double edged sword: it absolves viewers of responsibility, which makes them more open to the message but also less likely to do anything. I scrolled through the first couple pages of comments and saw a bunch of "corporations are greedy" complaints but couldn't find anyone suggesting a concrete behavioral change (for themselves or others).
I wonder if there's an adjacent version of this which keeps the viewer absolved of responsibility but still has a call to action. Plausible ideas:
In any case, kudos to the Kurzgesagt team for making a video on this which (as of this writing) has 2M+ views!
If you can get a better score than our human subjects did on any of METR's RE-Bench evals, send it to me and we will fly you out for an onsite interview
Caveats:
(Crossposted from twitter.)
Thanks for all your work Joey! If it is the case that your counterfactual impact is lower now, it is coming down from a high place, because I have been impressed with AIM for a while and my impression is that you were pivotal in founding and running it.