There are critical gaps in the accessibility and affordability of mental health services worldwide: In some countries, you have to wait for years for therapy, in others you get at max one session per month covered by the health insurance, in others therapies for particular conditions like cluster B personality disorders are virtually nonexistent.
We want to leverage LLMs to fill these gaps and complement regular therapy. Our product is in development. We've based it on Gemini and want it to interface with widely used messaging apps, so users can interact with it like they would with a friend or coach.
I’ve previously founded or worked for several charities and spent a few years in earning to give for work on invertebrate welfare and s-risks from AI.
You can get up to speed on my thinking at Impartial Priorities.
Over the past 12 years, I almost always avoided applying for any jobs in effective altruism – though they did often seem like dream jobs – because:
The upshot for me was:
Meanwhile rejections were not a problem for me, so it's not really “rejection sensitivity.” I talked to my friends about how I'm expected to react to them, and their advice was helpful. If I had dared to apply for particularly responsible roles, a rejection would've been a relief. After all, rejection is a return to safety. It's just mixed with the shame over whatever mistakes I must've made in front of the interviewers. I considered not going to any conferences anymore where I might run into them, but my friends told me that's unnecessary. And it's true because when I decided not to hire people at my companies, I didn't want them to avoid me afterwards even if they've made mistakes in the interviews.
But more recently I've updated in the following ways:
Finally, for anyone struggling with similar difficulties in the face of overwhelming responsibility, here's a small example of someone processing his responsibility for a terrible accident.
Exactly. I care that the ideas in the post are good. Who does the actual “typing” is irrelevant for me.
And ghostwriting is nothing new. It's somewhat common practice for busy top researchers to verbally discuss topics with more junior researchers who still have a good grasp on what they're talking about and then for the junior researchers to write there verbal discussions down in the form of an easily digestable article.
Perhaps there are reasons why ghostwriting should be disclosed or not, but I don't see why ghostwriting by an AI is a special case that deserves more attention than human ghostwriting.
The purpose of the whole first article is to lay out all these perspectives and make mine transparent. Originally I wanted to just write the sequence, but Claude suggested to add a whole 'nother post in front to introduce people to the ethical foundations.
We've had the discussion of ethical systems on this article. One of my core values is cooperativeness, and while I intuitively started out as classic utilitarian, I found it at odds with many common and widely held value systems and not in ways where I would've been ready to bite the bullet. E.g., it maps to Quiverfull, which is a rather extreme outlier. I also don't bite the bullet on the Repugnant Conclusion.
But most people don't even think in terms of unbounded maximization or minimization, and certainly not in terms of classic utilitarianism. It's such a niche view out there. Even if I talk about implications that I actually like, like tiling the universe with blazingly fast, happy computations, people look at me weird. If I want to engage in moral trades with them, I have to adjust to a wholly different viewpoint. So it was probably around 2013–14 or so that I felt compelled to abandon classic utilitarianism as extreme, niche, and untenable.
I never settled on anything new after that but rather tried to figure out what the smallest common denominator is of the largest possible subset of all ethical preferences that I could think of.
The smallest common demoninator that is widely shared is that suffering is bad. Lots of people care about lots of other stuff on top, and sometimes it's not the most important thing for them, but most people seem to be in agreement that suffering is bad. That's also why I got so interested in sadism, an important exception from the rule.
So I think by focusing just on “suffering is bad, let's reduce it” (and the particular framing around preferences that I find more intuitive), I can make my articles relevant to the widest audience possible. The Quiverfulls will have quibbles with it, the sadists will hate it, but maybe some 90+% of the world will benefit from them.
You adapt them to the other species. We can do the same with much better information with our friends: One friend enjoys tomato sauce; another friend doesn't enjoy it. You want to get pizza for all three of you, and it's supposed to be a surprise, so instead of being like, “Pizza has tomato sauce on it, and my friend doesn't like it, so I can't make any inference about whether they want pizza and should [get pizza for them anyway, get pizza only for myself and the other friend],” you can make adjustments like, “Pizza has tomato sauce on it, and my friend doesn't like it, so let's ask whether they can make pizza without tomato sauce.”
E.g., I give this example of eusocial vs. solitary insects. So when I wonder whether it's stressful for an insect to be caught in my flat and not finding the way out (despite some nutritious stuff I have lying around), my guess is that it's more likely stressful for the eusocial one who wants to get back to their hive than for the solitary one. When I had a fly over for a week or so in 2021, I gave her a name, enjoyed her company, and didn't worry much about her feeling trapped (I did leave a window open when I was awake), but when I had a bee over a few days ago, I was more concerned and helped the bee find their way out.
Likewise, humans can have some 0–3 children, cows some 5–10, and cats some 60–100 over their lifetimes. They all parent and alloparent. Turtles more like 1–2k, and no (allo-) parenting. So (just based on this data) I imagine that the loss of a child is almost as bad for a cow as it is for a human, that cats grieve somewhat less, and turtles not at all.
Or when it comes to pain, I mention in my last article that it probably doesn't make sense to have very strong pain signals, when an animal cannot react to them. So for nematodes it's not very useful to experience pain strongly; for a fly it may be very useful.
In your thought experiment there is a disconnect there that is crucial for my work: Antifrustrationism, as a form of preference utilitarianism, is all about the preferences of the individuals. If someone can fulfill their own preferences, there's nothing for me to do. If someone can tell me how I can help them fulfill their preferences, I'll happily do things for them that strike me as odd, like unusual kinks, because even though I don't share them, I trust their ability to know and communicate their preferences.
I only run into problems in cases where there is a power differential but the beings cannot communicate their preferences, e.g., because they are in the future, far away in the universe, or it's hard/impossible to build the sorts of experiments where one could elicit revealed preferences. Those are the cases where it gets uncomfortable because I need to make guesses and inferences.
It's like with a caring mother of an infant who can't talk yet: The infant can't help themselves (yet), so she has to do it, but to do that well, she has to infer the preferences rather than asking what they are.
With the aliens, the situation is reversed. They are powerful but really dumb and could just ask how they can really help us, and NPP is not an obvious lever for a K-strategist species. But we have this problem with aging. Maybe they'll stop by Earth 100k years ago, largely fail to communicate with us, but study our biology and notice that this aging thing really sucks, especially in the last couple of decades of an animal's life. So just leave a time capsule with information on how to reverse aging that's designed in such a way that they hope we can decypher it once we have the necessary foundational technologies.
A confusing thing about my experience is that the truth of the no-self state is intellectually ineluctable to me and yet my perception is still filtered by selfhood. Sometimes I get a burst of spite-fueled energy that almost collapses into fatalism, and I remind myself that I chose this, that I can just go back if I don't like it, but that I want to keep reading this chapter of my life, as it were, because it's exciting! But even when I introspect, there's a perspective there – someone has called it the watcher?
My current practical ethics
The question often comes up how we should make decisions under epistemic uncertainty and normative diversity of opinion. Since I need to make such decisions every day, I had to develop a personal system, however inchoative, to assist me.
A concrete (or granite) pyramid
My personal system can be thought of like a pyramid.
The ground floor
The ground floor of principles and heuristics is really the most interesting part for anyone who has to act in the world, so I won't further explain the top two floors.
The principles and heuristics should be expected to be messy. That is, I think, because they are by necessity the result of an intersubjective process of negotiation and moral trade (positive-sum compromise) with all the other agents and their preferences. (This should probably include acausal moral trades like Evidential Cooperation in Large Worlds.)
It should also be expected to be messy because these principles and heuristics have to satisfy all sorts of awkward criteria:
Three types of freedom
But really that leaves us still a lot of freedom (for better or worse):
These suggest a particular stance toward other activists:
Very few examples
In my experience, principles and heuristics are best identified by chatting with friends and generalizing from their various intuitions.
Various non-consequentialist ethical theories can come in handy here to generate further useful principles and heuristics. That is probably because they are attempts at generalizing from the intuitions of certain authors, which puts them almost on par (to the extent to which these authors are relateable to you) with generalizations from the intuitions of your friends.
(If you find my writing style hard to read, you can ask Claude to rephrase the message into a style that works for you.)