Bio

Participation
5

Hi, I'm Max :)

  • looking for work in AI governance (general strategy, expert surveys, research infrastructure, EU tech policy fellow)
  • background in cognitive science & biology (did research on metacognition)
  • most worried about AI going badly for technical & coordination reasons
  • vegan for the animals
  • doing my own forecasts: https://www.metaculus.com/accounts/profile/110500/

Comments
585

Topic contributions
2

(Just quick random thoughts.)

The more that Trump is perceived as a liability for the party, the more likely they would go along with an impeachment after a scandal.

  1. Reach out to Republicans in your state about your unhappiness about the recent behavior of the Trump administration.
  2. Financially support investigative reporting on the Trump administration.
  3. Go to protests?
  4. Comment on Twitter? On Truth Social?
    1. It's possibly underrated to write concise and common sense pushback in the Republican Twitter sphere?

I relate hard with the career struggles, thanks for sharing! :') Also very sweet (and again relatable) to drop everything for true love. :3

Thanks for writing this up, I think it's a really useful benchmark for tracking AI capabilities.

One minor feedback point, I feel like instead of reporting on statistical significance in the summary, I'd report on effect sizes, or maybe even better just put the discrimination plots in the summary as they give a very concrete and striking sense of the difference in performance. Statistical significance is affected by how many datapoints you have, which makes lack of a difference especially hard to interpret in terms of how real-world significant the difference is.

Most of my donations were forgone payments for hiring rounds at organizations I consider among the most promising at reducing risks from AI (e.g. Horizon and MIRI).

Thanks for sharing, I really appreciate your committment, and that you announce it.

Fwiw, my immediate reaction is that this type of protest might be a little too soon and it will cause more ridicule and backlash because the general public's and newsmedia's impression is that there is currently no immediate danger. Would be interested in learning more about the timing considerations. Like, I'd imagine that doing this barricading in the aftermath of some concrete harm happening would make favorable reporting for newsmedia much more likely, and then you could steer the discourse towards future and greater harms.

Cool, thanks for sharing!

We can sponsor US visas for technical roles

Does this apply to any of the roles you list here?

I love the new profiles, and also all the new formatting options for comments, plus the filtering for private notes. Thanks so much! :) 

Thanks so much for all your contributions Lizka! :) I really appreciated your presence on the forum, like a friendly, alive, and thoughtful soul that was attending to and helping grow this part of our ecosystem.

  • I can relate to the part about how unthankful it can be to be a mediator... it's a pretty interesting dynamic where something really useful is being disincentivized, would be interested in hearing more of your, or others' thoughts about it.
  • And I feel sorry that your work on moderation had negative effects on your interpersonal relationships. :| I don't know how that exactly looked like for you, but could also imagine an exploration into dynamics here might be pretty interested and potentially helpful to understand more widely.
  • The part about people being confused about CEA's role and activities, and the autonomy of the invidual teams, made me think that it might make sense to make the teams more prominent in comparison to CEA overall?
    • Like, giving teams more prominent names and brands (and cool logos!), emphasizing the teams more as the relevant entity that did something as opposed to CEA?

Thanks for doing this work, this seems like a particularly useful benchmark to track the world model of AI systems.

I found it pretty interesting to read the prompts you use, which are quite extensive and give a lot of useful structure to the reasoning. I was surprised to see in table 16 that the zero-shot prompts had almost the same performance level. The prompting kinda introduces a bunch of variance I imagine, and I wonder whether I should expect scaffolding (like https://futuresearch.ai/ are presumable focussing on) to cause significant improvements. 

Thanks, that all makes sense and moderates my optimism a bit, and it feels like we roughly exhausted the depth of my thinking. Sigh... anyways, I'm really thankful and maybe also optimistic for the work that dedicated and strategically thinking people like you have been and will be doing for animals.

Load more