Jonas Hallgren 🔸

490 karmaJoined Mar 2021Uppsala, Sweden

Bio

Participation
5

Co-Director of Equilibria Network: https://eq-network.org/

I try to write as if I were having a conversation with you in person.

I would like to claim that my current safety beliefs are a mix between Paul Christiano's, Andrew Critch's and Def/Acc.

Posts
15

Sorted by New

What does the systems perspective say about effective interventions?

Jonas Hallgren 🔸

· 9mo ago · 2m read

Advice for new alignment people: Info Max

Jonas Hallgren 🔸

· 2y ago

Max Tegmark's new Time article on how we're in a Don't Look Up scenario [Linkpost]

Jonas Hallgren 🔸

· 2y ago

The Benefits of Distillation in Research

Jonas Hallgren 🔸

· 2y ago · 6m read

Black Box Investigations Research Hackathon

Esben Kran

· 3y ago · 3m read

Announcing the Distillation for Alignment Practicum (DAP)

Jonas Hallgren 🔸

· 3y ago · 4m read

Stockholm Student Hackathon: Lessons for next time

RobPra

· 3y ago · 7m read

What is the moral values of nations?(China, for example)

Jonas Hallgren 🔸

· 3y ago · 1m read

EA Job Testing

Jonas Hallgren 🔸

· 3y ago · 9m read

PV: Running an EA org day, for example, "Effective Thesis day"

Jonas Hallgren 🔸

· 4y ago · 1m read

Comments
76

Topic contributions
3

A deep critique of AI 2027’s bad timeline models

Jonas Hallgren 🔸18d2

Besides the point that "shoddy toy models" might be emotionally charged, I just want to point out that accelerating progress majorly increases variance and unknown unknowns? The higher energy a system is and the more variables you have the more chaotic it becomes. So maybe an answer is that a agile short-range model is the best? Outside view it in moderation and plan with the next few years being quite difficult to predict?

You don't really need another model to disprove an existing one, you might as well point out that we don't know and that is okay too.

Don't Panic About Democracy in EA

Jonas Hallgren 🔸1mo1

Yeah, I think you're right and I also believe that it can be a both and?

You can have a general non-profit board and at the same time have a form of representative democracy going on which seems the best we can currently do for this?

I think it is fundamentally about a more timeless trade-off between hierarchical organisations that generally are able to act with more "commander's intent" versus democratic models that are more of a flat voting model. The democratic models suffer when there is a lot of single person linear thinking involved but do well at providing direct information for what people care about whilst the inverse is true for the hierarchical one and the project of good governance is to some extent somewhere in between.

Don't Panic About Democracy in EA

Jonas Hallgren 🔸1mo1

Yeah for sure, I think the devil might be in the details here around how things are run and what the purpose of the national organisation is. Since Sweden and Norway have 8x less of a population than germany I think the effect of a "nation-wide group" might be different?

In my experience, I've found that EA Sweden focuses on and provides a lot of the things that you listed so I would be very curious to hear what the difference between a local and national organisation would be? Is there a difference in the dynamics of them being motivated to sustain themselves because of the scale?

You probably have a lot more experience than me in this so it would be very interesting to hear!

Don't Panic About Democracy in EA

Jonas Hallgren 🔸1mo*3

I like that decomposition.

There's something about a prior on having democratic decision making as part of this because it allows for better community engagement usually? Representation often leads to feelings of inclusion and whilst I've only dabbled in the sociology here it seems like the option of saying no is quite important for members to feel heard?

My guess would be that the main pros of having democratic deliberation doesn't come from when the going is normal but rather as a resillience mechanism? Democracies tend to react late to major changes and not change path often but when they do they do it properly? (I think this statement is true but it might as well be a cultural myth that I've heard in the social choice adjacent community.)

How Democratic Is Effective Altruism — Really?

Jonas Hallgren 🔸1mo1

I think I went through it in Spring 2021? I remember discussing it then as one of the advanced optional topics, maybe around steering versus rowing and that the discussion went into that? I can't remember it more clearly than that though.

Don't Panic About Democracy in EA

Jonas Hallgren 🔸1mo19

First and foremost, I think the thoughts expressed here make sense and this comment is more just expressing a different perspective, not necessarily disagreeing.

I wanted to bring up an existing framework for thinking about this from Raghuram Rajan's "The Third Pillar," which provides economic arguments for why local communities matter even when they're less "efficient" than centralized alternatives.

The core economic benefits of local community structures include:

Information advantages: Local groups understand context that centralized organizations miss
Adaptation capacity: They can respond quickly to local opportunities and constraints
Social capital generation: They create trust networks that enable coordination
Motivation infrastructure: They provide ongoing support that sustains long-term engagement

So when you bring up the question of efficiency and adherence to optimal reflective practices I start thinking about it from a more systemic perspective.

Here's a question that comes to mind: if local EA communities make people 3x more motivated to pursue high-impact careers, or make it much easier for newcomers to engage with EA ideas, then even if these local groups are only operating at 75% efficiency compared to some theoretical global optimum, you still get significant net benefit.

I think this becomes a governance design problem rather than a simple efficiency question. The real challenge is building local communities that capture these motivational benefits while maintaining mechanisms for critical self-evaluation. (Which I think happens through impact evaluations and similar at least in EA Sweden.)

I disagree with the pure globalization solution here. From a broader macroeconomic perspective, we've seen repeatedly that dismantling local institutions in favor of "more efficient" centralized alternatives often destroys valuable social infrastructure that's hard to rebuild. The national EA model might be preserving something important that pure optimization would eliminate.

Conditional Forecasting As Model Parameterization

Jonas Hallgren 🔸3mo1

This is very nice!

I've been thinking that there's a nice generalisable analogy between bayesian updating and forecasting. (It is quite no shit when you think about it but it feels like people aren't exploiting it?)

I'm doing a project on simulating a version of this idea but in a way that utilizes democratic decision making called Predictive Liquid Democracy (PLD) and I would love to hear if you have any thoughts on the general setup. It is model parameterization but within a specific democratic framing.

PLD is basically saying the following:

What if we could set up a trust based meritocratic voting network based on the predictions about how well a candidate will perform? It is futarchy with some twists.

Now for the generalised framing in terms of graphs that I'm thinking of:

As an example, if we look at a research network we can say that they're trying to optimise for a certain set of outcomes (citations, new research) and they're trying to make predictions that are going to work. P(U|A)
From a system perspective it is hard to influence the nodes even though it is possible. We therfore say that the edges of the graph that is the research network is what we'll optimise. We can then set up a graph that has the signals and graph connections optimised to reach the truth.
Since we don't care about the nodes we can also use AIs to help in a combination with human experts.

I'm writing a paper on setting up the variational mathematics behind this right now. I'm also writing a paper on some more specific simulations of this to run so I'm very grateful for any thoughts you might have of this setup!

How Democratic Is Effective Altruism — Really?

Jonas Hallgren 🔸3mo*8

Some people might find that this post is written from a place of agitation which is fully okay. I think that even if you do there are two things that I would want to point out as really good points:

A dependence on funders and people with money as something that shapes social capital and incentives, therefore thought in itself. We should therefore be quite vary of the effect that has on people, this can definetely be felt in the community and I think it is a great point.
That the karma algorithm could be revisited and that we should think about what incentives are created for the forum through it.

I think there's a very very interesting project of democratizingthe EA community in a way that makes it more effective. There are lots of institutional design that we can apply to ourselves and I would be very excited to see more work in this direction!

Edit:

Clarification on why I believe it to cause some agitation for some people:

I remember that some of the situation around Cremer being a bit politically loaded and that the emotions were running hot at that time and so citing that specific situation makes it lack a bit of context.
1. There are some object level things that people within the community disagree with when it comes to these comments that point at deeper issues of epistemics and cause prioritization that is actually difficult to answer.
2. The post makes it seem more one-sided than that situation was. Elitism in EA is something covered in the in-depth fellowship for example and there's a bunch of back and forth there but it is an issue that you will arrive at different consequences on depending on what modelling assumptions you do.
3. I don't want to make a value judgement on this here, I just want to point out that specifice piece of Cremer's writing has always felt a bit thorny which makes the references feel a bit inflammatory?
4. For me it's the vibe that it is written from a perspective of being post EA and something about when leaving something behind you want to get back at the thing itself by pointing out how it's wrong? So it is kind of written from a emotionally framed perspective which makes the epistemics fraught?
There's some sort of degree where the framing of the post in itself pattern matches onto other critiques that have felt bad faith and so it is "inflammatory" that it raises the immune system of people reading it. I do still think it is quite a valuable point, it is just that part of the phrasing makes it come across more like this than it has to be?
1. I think that might be because of LLMs often liking to argue towards a specific point but I'm not sure?
  (You've got some writing that is reminiscent of claude so I could spot the use of it: e.g):

This isn’t just a technical issue. This is a design philosophy — one that rewards orthodoxy, punishes dissent, and enforces existing hierarchies.

I liked the post, I think it made a good point, I strong upvoted it but I wanted to mention it as a caveat.

Introducing The Spending What We Must Pledge

Jonas Hallgren 🔸3mo12

I felt that this post might be relevant for longtermism and person affecting views so I had claude write up a quick report on that:

In short: Rejecting the SWWM 💸11% pledge's EV calculation logically commits you to person-affecting views, effectively transforming you from a longtermist into a neartermist.

Example: Bob rejects investing in a $500 ergonomic chair despite the calculation showing 10^50 * 1.2*10^-49 = 12 lives saved due to "uncertainty in the probabilities." Yet Bob still identifies as a longtermist who believes we should value future generations. This is inconsistent, as longtermism fundamentally relies on the same expected value calculations with uncertain probabilities that SWWM uses.

The 🔮 Badge
If you've rejected the SWWM 💸11% Pledge while maintaining longtermist views, we'd appreciate if you could add the 🔮 "crystal ball" emoji to your social media profiles to signal your epistemic inconsistency.

FAQ
Why can't I reject SWWM but stay a longtermist? Both longtermism and SWWM rely on the same decision-theoretic framework of accepting tiny probabilities of affecting vast future populations. Our analysis shows the error bars in SWWM calculations (±0.0000000000000000000000000000000000000000000001%) are actually narrower than the error bars in most longtermist calculations.

What alternatives do I have?

Accept the SWWM 💸11% pledge (consistent longtermist)
Reject both SWWM and longtermism (consistent person-affecting view)
Add the 🔮 emoji to your profile (inconsistent but transparent)

According to our comprehensive Fermi estimate, maintaining consistency between your views on SWWM and longtermism is approximately 4.2x more philosophically respectable.

Discussion Thread: Existential Choices Debate Week

Jonas Hallgren 🔸4mo1

21% agree

First and foremost, I'm low confidence here.

I will focus on x-risk from AI and I will challenge the premise of this being the right way to ask the question.

What is the difference between x-risk and s-risk/increasing the value of futures? When we mention x-risk with regards to AI we think of humans going extinct but I believe that to be a shortform for wise compassionate decision making. (at least in the EA sphere)

Personally, I think that x-risk and good decision making in terms of moral value might be coupled to each other. We can think of our current governance conditions a bit like correction systems for individual errors. If they pile up, we go off the rail and increase x-risk as well as chances of a bad future.

So a good decision making system should both account for x-risk and value estimation, therefore the solution is the same and it is a false dichotomy?

(I might be wrong and I appreciate the slider question anyway!)

Jonas Hallgren 🔸

Bio

Participation5

Posts 15

Comments76

Topic contributions3

Participation
5

Posts
15

Comments
76

Topic contributions
3