All of Anthony DiGiovanni's Comments + Replies

My understanding is that:

  1. Spite (as a preference we might want to reduce in AIs) has just been relatively well-studied compared to other malevolent preferences. If this subfield of AI safety were more mature there might be less emphasis on spite in particular.
  2. (Less confident, haven't thought that much about this:) It seems conceptually more straightforward what sorts of training environments are conducive to spite, compared to fanaticism (or fussiness or little-to-lose, for that matter).
5
David_Althaus
1mo
Thanks Anthony!  Regarding 2: I'm totally no expert but it seems to me that there are other ways of influencing the preferences/dispositions of AI—e.g., i) penalizing, say, malevolent or fanatical reasoning/behavior/attitudes (e.g., by telling RLHF raters to specifically look out for such properties and penalize them), or ii) similarly amending the principles and rules of constitutional AI.  

Thanks for asking — you can read more about these two sources of s-risk in Section 3.2 of our new intro to s-risks article. (We also discuss "near miss" there, but our current best guess is that such scenarios are significantly less likely than other s-risks of comparable scale.)

I agree with your reasoning here—while I think working on s-risks from AI conflict is a top priority, I wouldn't give Dawn's argument for it. This post gives the main arguments for why some "rational" AIs wouldn't avoid conflicts by default, and some high-level ways we could steer AIs into the subset that would.

2
Dawn Drescher
1y
Agreed, and thanks for linking the article!

I've found this super useful over the past several months—thanks!

1
WobblyPanda2
10mo
Am I alone in finding the text-to-speech voice really unpleasant? I can recommend, instead, Freya's voice and pacing at: https://www.naturalreaders.com/online/ There's even text-to-speech with a really natural sounding Snoop Dogg voice by a different company. These are all way more engaging options.

Given that you can just keep doing better and better essentially indefinitely, and that GPT is not anywhere near the upper limit, talking about the difficulty of the task isn't super meaningful.

I don't understand this claim. Why would the difficulty of the task not be super meaningful when training to performance that isn't near the upper limit?

As an analogy: consider a variant of rock paper scissors where you get to see your opponent's move in advance---but it's encrypted with RSA. In some sense this game is much harder than proving Fermat's last theorem, since playing optimally requires breaking the encryption scheme. But if you train a policy and find that it wins 33% of the time at encrypted rock paper scissors, it's not super meaningful or interesting to say that the task is super hard, and in the relevant intuitive sense it's an easier task than proving Fermat's last theorem.

In "Against neutrality...," he notes that he's not arguing for a moral duty to create happy people, and it's just good "others things equal." But, given that the moral question under opportunity costs is what practically matters, what are his thoughts on this view?: "Even if creating happy lives is good in some (say) aesthetic sense, relieving suffering has moral priority when you have to choose between these." E.g., does he have any sympathy for the intuition that, if you could either press a button that treats someone's migraine for a day or one that cre... (read more)

I am (clearly) not Tobias, but I'd expect many people familiar with EA and LW would get something new out of Ch 2, 4, 5, and 7-11. Of these, seems like the latter half of 5, 9, and 11 would be especially novel if you're already familiar with the basics of s-risks along the lines of the intro resources that CRS and CLR have published. I think the content of 7 and 10 is sufficiently crucial that it's probably worth reading even if you've checked out those older resources, despite some overlap.

7
Tobias_Baumann
1y
I agree with this answer. That's great, because that is also the starting point of my book. From the introduction: That is, I'm not dwelling on an argument for these fundamental values, as that can be found elsewhere.

Anecdote: My grad school personal statement mentioned "Concrete Problems in AI Safety" and Superintelligence, though at a fairly vague level about the risks of distributional shift or the like. I got into some pretty respectable programs. I wouldn't take this as strong evidence, of course.

I'm fine with other phrasings and am also concerned about value lock-in and s-risks though I think these can be thought of as a class of x-risks

I'm not keen on classifying s-risks as x-risks because, for better or worse, most people really just seem to mean "extinction or permanent human disempowerment" when they talk about "x-risks." I worry that a motte-and-bailey can happen here, where (1) people include s-risks within x-risks when trying to get people on board with focusing on x-risks, but then (2) their further discussion of x-risks basically equates them with non-s-x-risks. The fact that the "dictionary definition" of x-risks would include s-risks doesn't solve this problem.

I think this is a valid concern. Separately, it's not clear that all s-risks are x-risks, depending on how "astronomical suffering" and "human potential" are understood.

What do you think about the concept of a hellish existential catastrophe? It highlights both that (some) s-risks fall under the category of existential risk and that they have an additional important property absent from typical x-risks. The concept isolates a risk the reduction of which should arguably be prioritized by EAs with different moral perspectives.

e.g. 2 minds with equally passionate complete enthusiasm  (with no contrary psychological processes or internal currencies to provide reference points) respectively for and against their own experience, or gratitude and anger for their birth (past or future).  They  can respectively consider a world with and without their existences completely unbearable and beyond compensation. But if we're in the business of helping others for their own sakes rather than ours, I don't see the case for excluding either one's concern from our moral circle.

..

... (read more)
9
Brian_Tomasik
2y
What if the individual says that after thinking very deeply about it, they believe their existence genuinely is much better than not having existed? If we're trying to be altruistic toward their own values, presumably we should also value their existence as better than nothingness (unless we think they're mistaken)? One could say that if they don't currently exist, then their nonexistence isn't a problem. It's true that their nonexistence doesn't cause suffering, but it does make impartial-altruistic total value lower than otherwise if we would consider their existence to be positive.

...Having said that, I do think the "deeper intuition that the existing Ann must in some way come before need-not-ever-exist-at-all Ben" plausibly boils down to some kind of antifrustrationist or tranquilist intuition. Ann comes first because she has actual preferences (/experiences of desire) that get violated when she's deprived of happiness. Not creating Ben doesn't violate any preferences of Ben's.

4
Richard Y Chappell
2y
I don't think so.  I'm sure that Roberts would, for example, think we had more reason to give Ann a lollipop than to bring Ben into existence and give him one, even if Ann would not in any way be frustrated by the lack of a lollipop. The far more natural explanation is just that we have person-directed reasons to want what is good for Ann, in addition to the impersonal reasons we have to want a better world (realizable by either benefiting Ann or creating & benefiting Ben).

certainly don't reflect the kinds of concerns expressed by Setiya that I was responding to in the OP

I agree. I happen to agree with you that the attempts to accommodate the procreation asymmetry without lexically disvaluing suffering don't hold up to scrutiny. Setiya's critique missed the mark pretty hard, e.g. this part just completely ignores that this view violates transitivity:

But the argument is flawed. Neutrality says that having a child with a good enough life is on a par with staying childless, not that the outcome in which you have a child is equa

... (read more)
4
Richard Y Chappell
2y
In fairness to Setiya, the whole point of parity relations (as developed, with some sophistication, by Ruth Chang) is that they -- unlike traditional value relations -- are not meant to be transitive.  If you're not familiar with the idea, I sketch a rough intro here.
7
Anthony DiGiovanni
2y
...Having said that, I do think the "deeper intuition that the existing Ann must in some way come before need-not-ever-exist-at-all Ben" plausibly boils down to some kind of antifrustrationist or tranquilist intuition. Ann comes first because she has actual preferences (/experiences of desire) that get violated when she's deprived of happiness. Not creating Ben doesn't violate any preferences of Ben's.

appeal to some form of partiality or personal prerogative seems much more appropriate to me than denying the value of the beneficiaries

I don't think this solves the problem, at least if one has the intuition (as I do) that it's not the current existence of the people who are extremely harmed to produce happy lives that makes this tradeoff "very repugnant." It doesn't seem any more palatable to allow arbitrarily many people in the long-term future (rather than the present) to suffer for the sake of sufficiently many more added happy lives. Even if those liv... (read more)

5
Richard Y Chappell
2y
Interesting!  Yeah, a committed anti-natalist who regrets all of existence -- even in an "approximate utopia" -- on the grounds that even a small proportion of very unhappy lives automatically trumps the positive value of a world mostly containing overwhelmingly wonderful, flourishing lives  is, IMO, in the grips of... um (trying to word this delicately)... values I strongly disagree with.  We will just have very persistent disagreements, in that case! FWIW, I think those extreme anti-natalist values are unusual, and certainly don't reflect the kinds of concerns expressed by Setiya that I was responding to in the OP (or other common views in the vicinity, e.g. Melinda Roberts' "deeper intuition that the existing Ann must in some way come before need-not-ever-exist-at-all Ben").
I think such views have major problems, but I don’t talk about those problems in the book. (Briefly: If you think that any X outweighs any Y, then you seem forced to believe that any probability of X, no matter how tiny, outweighs any Y. So: you can either prevent a one in a trillion trillion trillion chance of someone with a suffering life coming into existence, or guarantee a trillion lives of bliss. The lexical view says you should do the former. This seems wrong, and I think doesn’t hold up under moral uncertainty, either. There are ways of avo
... (read more)
The Asymmetry endorses neutrality about bringing into existence lives that have positive wellbeing, and I argue against this view for much of the population ethics chapter, in the sections “The Intuition of Neutrality”,  “Clumsy Gods: The Fragility of Identity”, and “Why the Intuition of Neutrality is Wrong”.

You seem to be using a different definition of the Asymmetry than Magnus is, and I'm not sure it's a much more common one. On Magnus's definition (which is also used by e.g. Chappell; Holtug, Nils (2004), "Person-affecting Moralities"; and ... (read more)

3
Evžen
1y
If bringing into existence lives that have positive wellbeing is at best neutral (and presumably strongly negative for lives with negative wellbeing) — why have children at all? Is it their instrumental value they bring in their lives that we're after under this philosophy? (Sorry, I'm almost surely missing something very basic here — not a philosopher.)
Are you saying that from your and Teo's POVs, there's a way to 'improve a mental state' that doesn't amount to decreasing suffering (/preventing it)?

No, that's precisely what I'm denying. So, the reason I mentioned that "arbitrary" view was that I thought Jack might be conflating my/Teo's view with one that (1) agrees that happiness intrinsically improves a mental state, but (2) denies that improving a mental state in this particular way is good (while improving a mental state via suffering-reduction is good).

Such an understanding seems plausible in a s
... (read more)
1
Dan Hageman
2y
Okay, thanks for clarifying for me! I think I was  confused in that opening line when you clarified that your views do not say that only a relief of suffering improves a mental state, but in reality it's that you do think such is the case, just not in conjunction with the claim that happiness also intrinsically improves a mental state, correct? >Analogously, you can increase the complexity and artistic sophistication of some painting, say, but if no one ever observes it (which I'm comparing to no one suffering from the lack of more intense happiness), there's no "improvement" to the painting. With respect to this, I should have clarified that the state of contentment, that becomes a more intense positive state was one of an existing and experiencing being, not a content state of non-existence and then pleasure is brought into existence. Given the latter, would the painting analogy hold, since in this thought experiment there is an experiencer who has some sort of improvement in their mental state, albeit not a categorical sort of improvement that is on par with the sort the relives suffering? I.e. It wasn't a problem per se (no suffering) that they were being deprived of the more intense pleasure, but the move from lower pleasure to higher pleasure is still an improvement in some way (albeit perhaps a better word would be needed to distinguish the lexical importance between these sorts of *improvements*).  

Some things I liked about What We Owe the Future, despite my disagreements with the treatment of value asymmetries:

  • The thought experiment of imagining that you live one big super-life composed of all sentient beings’ experiences is cool, as a way of probing moral intuitions. (I'd say this kind of thought experiment is the core of ethics.)
    • It seems better than e.g. Rawls’ veil of ignorance because living all lives (1) makes it more salient that the possibly rare extreme experiences of some lives still exist even if you're (un)lucky enough not to go through t
... (read more)

I think one crux here is that Teo and I would say, calling an increase in the intensity of a happy experience "improving one's mental state" is a substantive philosophical claim. The kind of view we're defending does not say something like, "Improvements of one's mental state are only good if they relieve suffering." I would agree that that sounds kind of arbitrary.

The more defensible alternative is that replacing contentment (or absence of any experience) with increasingly intense happiness / meaning / love is not itself an improvement in mental state. And this follows from intuitions like "If a mind doesn't experience a need for change (and won't do so in the future), what is there to improve?"

1
Dan Hageman
2y
Can you elaborate a bit on why the seemingly arbitrary view you quoted in your first paragraph wouldn't follow, from the view that you and Teo are defending?  Are you saying that from your and Teo's POVs, there's a way to 'improve a mental state' that doesn't amount to decreasing suffering (/preventing it)? The statement itself seems a bit odd, since 'improvements' seems to imply 'goodness', and the statement hypothetically considers situations where improvements may not be good..so thought  I would see if you could clarify. In regards to the 'defensible alternative', it seems that one could  defend a plausible view that a state of contentment, moved to a state of increased bliss, is indeed an improvement, even though there wasn't a need for change. Such an understanding seems plausible in a self-intimating way when one valence state transitions to the next, insofar as we concede that there are states of more or less pleasure, outside an negatively valanced states. It seems that one could do this all the while maintaining that such improvements are never capable of outweighing the mitigation of problematic, suffering states. **Note, using the term improvement can easily lead to accidental equivocation between scenarios of mitigating suffering versus increasing pleasure, but the ethical discernment  between each seems manageable. 
Is it thought experiments such as the ones Magnus has put forward? I think these argue that alleviating suffering is more pressing than creating happiness, but I don't think these argue that creating happiness isn't good.

I think they do argue that creating happiness isn't intrinsically good, because you can always construct a version of the Very Repugnant Conclusion that applies to a view that says suffering is weighed some finite X times more than happiness, and I find those versions almost as repugnant. E.g. suppose that on classical utilitarianism we... (read more)

5
JackM
2y
“I have basically no sympathy to the perspective that considers the pain intrinsically necessary in this scenario, or any scenario.” I wasn’t expecting you to. I don’t have any sympathy for it either! I was just giving you an argument that I suspect many others would find compelling. Certainly if my sister died and I didn’t feel anything, my parents wouldn’t like that! Maybe it’s not particularly relevant to you if an argument is considered compelling by others, but I wanted to raise it just in case. I certainly don’t expect to change your mind on this - nor do I want to as I also think suffering is bad! I’m just not sure suffering being bad is a smaller leap than saying happiness is good.

This only applies to flavors of the Asymmetry that treat happiness as intrinsically valuable, such that you would pay to add happiness to a "neutral" life (without relieving any suffering by doing so). If the reason you don't consider it good to create new lives with more happiness than suffering is that you don't think happiness is intrinsically valuable, at least not at the price of increasing suffering, then you can't get Dutch booked this way. See this comment.

I didn't directly respond to the other one because the principle is exactly the same. I'm puzzled that you think otherwise.

Removing their sadness at separation while leaving their desire to be together intact isn't a clear Pareto improvement unless one already accepts that pain is what is bad.

I mean, in thought experiments like this all one can hope for is to probe intuitions that you either do or don't have. It's not question-begging on my part because my point is: Imagine that you can remove the cow's suffering but leave everything else practically the... (read more)

3
Hank_B
2y
I think my above reply missed the mark here. Sticking with the cow example, I agree with you that if we removed their pain at being separated while leaving the desire to be together intact, this seems like a Pareto improvement over not removing their pain.   A preferentist would insist here that the removal of pain is not what makes that situation better, but rather that pain is (probably) dis-prefered by the cows, so removing it gives them something they want.   But the negative hedonist (pain is bad, pleasure is neutral) is stuck with saying that the "drugged into happiness" image is as good as the "cows happily reunited" image. A preferentist by contrast can (I think intuitively) assert that reuniting the cows is better than just removing their pain, because reunification fulfills (1) the cows desire to be free of pain and (2) their desire to be together.

Here's another way of saying my objection to your original comment: What makes "happiness is intrinsically good" more of an axiom than "sufficiently intense suffering is morally serious in a sense that happiness (of the sort that doesn't relieve any suffering) isn't, so the latter can't compensate for the former"? I don't see what answer you can give that doesn't appeal to intuitions about cases.

  • That case does run counter to "suffering is intrinsically bad but happiness isn't," but it doesn't run counter to "suffering is bad," which is what your last comment asked about. I don't see any compelling reasons to doubt that suffering is bad, but I do see some compelling reasons to doubt that happiness is good.
  • That's just an intuition, no? (i.e. that everyone painlessly dying would be bad.) I don't really understand why you want to call it an "axiom" that happiness is intrinsically good, as if this is stronger than an intuition, which seemed to be the
... (read more)
6
JackM
2y
What is your compelling reason to doubt happiness is good? Is it thought experiments such as the ones Magnus has put forward? I think these argue that alleviating suffering is more pressing than creating happiness, but I don't think these argue that creating happiness isn't good. I do happen to think suffering is bad, but here is a potentially reasonable counterargument - some people think that suffering is what makes life meaningful. For example some think of the idea of drugs being widespread, alleviating everyone of all pain all the time, is monstrous. People's children would get killed and the parents just wouldn’t  feel any negative emotion - this seems a bit wrong... You could try to use your pareto improvement argument here i.e. that it's better if parents still have a preference for their child not to have been killed, but also not to feel any sort of pain related to it. Firstly, I do think many people would want there to be some pain in this situation and that they would think of a lack of pain being disrespectful and grotesque. Otherwise I'm slightly confused about one having a preference that the child wasn't killed, but also not feeling any sort of hedonic pain about it...is this contradictory? As I said I do think suffering is bad, but I'm yet to be convinced this is less of a leap of faith than saying happiness is good.

For all practical purposes suffering is dispreferred by beings who experience it, as you know, so I don't find this to be a counterexample. When you say you don't want someone to make you less sad about the problems in the world, it seems like a Pareto improvement would be to relieve your sadness without changing your motivation to solve those problems—if you agree, it seems you should agree the sadness itself is intrinsically bad.

2
MichaelStJules
2y
I don't have settled views on whether or not suffering is necessarily bad in itself. That someone (or almost everyone) disprefers suffering doesn’t mean suffering is bad in itself. Even if people always disprefer less pleasure, it wouldn't follow that the absence of pleasure is bad in itself. Even those with symmetric views wouldn't say so; they'd say its absence is neutral and its presence is good and better. We wouldn't say dispreferring suffering makes the absence of suffering an intrinsic good. I'm sympathetic to a more general "relative-only" view according to which suffering is an evaluative impression against the state someone is in relative to an "empty" state or nonexistence, so a kind of self-undermining evaluation. Maybe this is close enough to intrinsic badness and can be treated like intrinsic badness, but it doesn't seem to actually be intrinsic badness. I think Frick's approach, Bader’s approach and Actualism, each applied to preferences that are "relative only" rather than whole lives, could still imply that worlds with less suffering are better and some lives with suffering are better not started, all else equal, while no lives are better started, all else equal. This is compatible with the reason we suffer sometimes being because of mere relative evaluations between states of the world without being "against" the current state or things being worse than nothing. It seems that a hedonist would need to say that removing my motivation is no harm to me personally, either (except for instrumental reasons), but that violates an interest of mine so seems wrong to me. This doesn't necessarily count against suffering being bad in itself or respond to your proposed Pareto improvement, it could just count against only suffering mattering.
3
Hank_B
2y
This response is a bit weird to me because the linked post has two counter-examples and you only answered one, but I feel like the other still applies. The other thought experiment mentioned in the piece is that of a cow separated from her calf and the two bovines being distressed by this. Michael says (and I'm sympathetic) that the moral action here is to fulfill the bovines preferences to be together, not remove their pain at separation without fulfilling that preference (e.g. through drugging the cows into bliss). Your response about Pareto Improvements doesn't seem to work here, or seems less intuitive to me at least. Removing their sadness at separation while leaving their desire to be together intact isn't a clear Pareto improvement unless one already accepts that pain is what is bad. And it is precisely the imagining of a separated cow/calf duo drugged into happiness but wanting one another that makes me think maybe it isn't the pain that matters.

No, I know of no thought experiments or any arguments generally that make me doubt that suffering is bad. Do you?

3
JackM
2y
Well if you think suffering is bad and pleasure is not good then the counterintuitive (to the vast majority of people) conclusion is that we should (painlessly if possible, but probably painfully if necessary) ensure everyone gets killed off so that we never have any suffering again. It may well be true that we should ensure everyone gets killed off, but this is certainly an argument that many find compelling against the dual claim that suffering is bad and pleasure is not good.
2
MichaelStJules
2y
https://forum.effectivealtruism.org/posts/GK7Qq4kww5D8ndckR/michaelstjules-s-shortform?commentId=LZNATg5BoBT3w5AYz

On a really basic level my philosophical argument would be that suffering is bad, and pleasure is good (the most basic of ethical axioms that we have to accept to get consequentialist ethics off the ground).

It seems like you're just relying on your intuition that pleasure is intrinsically good, and calling that an axiom we have to accept. I don't think we have to accept that at all — rejecting it does have some counterintuitive consequences, I won't deny that, but so does accepting it. It's not at all obvious (and Magnus's post points to some reasons we might favor rejecting this "axiom").

2
JackM
2y
Would you say that saying suffering is bad is a similar intuition?

This is how Parfit formulated the Repugnant Conclusion, but the way it's usually referred to in population ethics discussions about the (de)merits of total symmetric utilitarianism, it need not be the case that the muzak and potatoes lives never suffer.

The real RC that some kinds of total views face is that world A with lives of much more happiness than suffering is worse than world Z with more lives of just barely more happiness than suffering. How repugnant this is, for some people like myself, depends on how much happiness or suffering is in those lives... (read more)

which goes against the belief in a net-positive future upon which longtermism is predicated

Longtermism per se isn't predicated on that belief at all—if the future is net-negative, it's still (overwhelmingly) important to make future lives less bad.

But I want to be clear that this normative disagreement isn't evidence of any philosophical defect on our part.

Oh I absolutely agree with this. My objections to that quote have no bearing on how legitimate your view is, and I never claimed as much. What I find objectionable is that by using such dismissive language about the view you disagree with, not merely critical language, you're causing harm to population ethics discourse. Ideally readers will form their views on this topic based on their merits and intuitions, not based on claims that views are "too... (read more)

It seems like you're conflating the following two views:

  1. Utilitarianism.net has an obligation to present views other than total symmetric utilitarianism in a sympathetic light.
  2. Utilitarianism.net has an obligation not to present views other than total symmetric utilitarianism in an uncharitable and dismissive light.

I would claim #2, not #1, and presumably so would Michael. The quote about nihilism etc. is objectionable because it's not just unsympathetic to such views, it's condescending. Clearly many people who have reflected carefully about ethics thi... (read more)

-2
Richard Y Chappell
2y
It seems to me that you're conflating process and substance.  Philosophical charity is a process virtue, and one that I believe our article exemplifies. (Again, the exploration of value blur offers a charitable development of the view in question.)  You just don't like that our substantive verdict on the view is very negative.  And that's fine, you don't have to like it.  But I want to be clear that this normative disagreement isn't evidence of any philosophical defect on our part. (And I should flag that Michael's process objections, e.g. complaining that we didn't preface every normative claim with the tedious disclaimer "in our opinion", reveals a lack of familiarity with standard norms for writing academic philosophy.)  This sociological claim isn't philosophically relevant.  There's nothing inherently objectionable about concluding that some people have been mistaken in their belief that a certain view is worth taking seriously.  There's also nothing inherently objectionable about making claims that are controversial.  (Every interesting philosophical claim is controversial.) What you're implicitly demanding is that we refrain from doing philosophy (which involves taking positions, including ones that others might dislike or find controversial), and instead merely report on others' arguments and opinions in a NPOV fashion.  That's a fine norm for wikipedia, but I don't think it's a reasonable demand to make of all philosophers in all places, and IMO it would make utilitarianism.net worse (and something I, personally, would be much less interested in creating and contributing to) if we were to try to implement it there. As a process matter, I'm all in favour of letting a thousand flowers bloom. If you don't like our philosophical POV, feel free to make your own resource that presents things from a POV you find more congenial!  And certainly if we're making philosophical errors, or overlooking important counterarguments, I'm happy to have any of that drawn to
One is that views of the "making people happy" variety basically always wind up facing structural weirdness when you formalize them. It was my impression until recently that all of these views imply intransitive preferences (i.e something like A>B>C>A), until I had a discussion with Michael St Jules in which he pointed out more recent work that instead denies the independence of irrelevant alternatives.

It depends if by valuing "making people happy" one means 1) intrinsically valuing adding happiness to existing people's lives, or 2) valuing "m... (read more)

Second, I might be mistaken about what this agent’s choice would be. For instance, perhaps the lake is so cold that the pain of jumping in is of greater moral importance than any happiness I obtain.

Yeah, I think this is pretty plausible at least for sufficiently horrible forms of suffering (and probably all forms, upon reflection on how bad the alternative moral views are IMO). I doubt my common sense intuitions about bundles of happiness and suffering can properly empathize, in my state of current comfort, with the suffering-moments.

But given you said t... (read more)

5
Aaron Bergman
2y
Yeah...I think I just forgot to add that? Although it seems the least (empirically) likely of the four possibilities
There is a defense of ideas related to your position here

For the record I also don't find that post compelling, and I'm not sure how related it is to my point. I think you can coherently think that the moral truth is consistent (and that ethics is likely to not be consistent if there is no moral truth), but be uncertain about it. Analogously I'm pretty uncertain what the correct decision theory is, and think that whatever that decision theory is, it would have to be self-consistent.

2
abrahamrowe
2y
Yeah that makes sense to me. To be clear, the fact that two smart people have told me that they disagree with my sense that moral realism pushes against consistency seems like good evidence that my intuitions shouldn't be taken too strongly here.
I also would be interested in seeing someone compare the tradeoffs on non- views vs person-affecting. E.g. person affecting views might entail X weirdness, but maybe X weirdness is better to accept than the repugnant conclusion, etc.

Agreed—while I expect people's intuitions on which is "better" to differ, a comprehensive accounting of which bullets different views have to bite would be a really handy resource. By "comprehensive" I don't mean literally every possible thought experiment, of course, but something that gives a sense of the significant consi... (read more)

Also, moral realism seems more predictive of ethics being consistent, not less. (Not consistent with our unreflected intuitions, though.)

2
abrahamrowe
2y
My spouse shared this view when reading a draft of this post, which I found interesting because my intuitions went somewhat strongly the other way. I don't really have strong views here, but it seems like are three possible scenarios for realists: * Morality follows consistent rules and behave according to a logic we currently use * Morality follow consistent rules but doesn't behave according to a logic we currently use * Morality doesn't follow consistent rules And in 2/3 of those, this problem might exist, so I leaned toward saying that this was an issue for realists. There is a defense of ideas related to your position here that I didn't find it particularly compelling personally.

I'm confused — welfare economics seems premised on the view that interpersonal comparisons of utility are possible. In any case, ethics =/= economics; comparisons of charity effectiveness aren't assessing interpersonal "utility" in the sense of VNM preferences, they're concerned with "utility" in the sense of e.g. hedonic states, life satisfaction, so-called objective lists, and so on.

1
Barracuda
2y
To quote John C Harsanyi in ''from most branches of economics the concept of cardinal utility has been eliminated as redundant since ordinal utility has been found to suffice for doing the job. Cardinal utility has been kept only in welfare economics to support the demand for a more equal income distribution.'  I would recommend further reading on the ordinal revolution that followed the marginal revolution. The reason for restricting mainstream economics to ordinal rather than cardinal utility was based not arbitrary, and the effectiveness of economics compared to ethics should be considered if one is to be chosen over the other in the context of effective altruism. In any case, I see how frosty a reception my ideas have had on here, not just in this post or comments, and don't feel this is fertile ground for new ideas outside the echo chamber. I don't expect to return here but I think I do get an email if someone private messages me so if anyone has something they want to reach me feel free to personal message me - thanks muchly

No, longtermism is not redundant

I’m not keen on the recent trend of arguments that persuading people of longtermism is unnecessary, or even counterproductive, for encouraging them to work on certain cause areas (e.g., here, here). This is for a few reasons:

  • It’s not enough to believe that extinction risks within our lifetimes are high, and that extinction would constitute a significant moral problem purely on the grounds of harms to existing beings. Arguments for the tractability of reducing those risks, sufficient to outweigh the nearterm good done by focu
... (read more)

I think this is just an equivocation of "utility." Utility in the ethical sense is not identical to the "utility" of von Neumann Morgenstern utility functions.

1
Barracuda
2y
I don't think so - this is mainstream usage of the term in welfare economics.

It's notable that a pilot study (N = 172, compared to N = 474 for the results given in Fig. 1) discussed in the supplementary materials of this paper suggests a stronger suffering/happiness asymmetry in people's intuitions about creating populations. e.g. In response to the question, “Suppose you could push a button that created a new world with X people who are generally happy and 10 people who generally suffer. How high would X have to be for you to push the button?”, the median response was X = 1000.

For a mundane example, imagine I'm ambivalent about mini-golfing. But you know me, and you suspect I'll love it, so you take me mini-golfing. Afterwards, I enthusiastically agree that you were right, and I loved mini-golfing.

It seems you can accommodate this just as well, if not better, within a hedonistic view—you didn't prefer to go mini-golfing, but mini-golfing made you happier once you tried it, so that's why you endorse people encouraging you to try new things. (Although I'm inclined to say, it really depends on what you would've otherwise done with your time instead of mini-golfing, and if someone is fine not wanting something, it's reasonable to err on the side of not making them want it.)

In Defense of Aiming for the Minimum

I’m not really sympathetic to the following common sentiment: “EAs should not try to do as much good as feasible at the expense of their own well-being / the good of their close associates.”

It’s tautologically true that if trying to hyper-optimize comes at too much of a cost to the energy you can devote to your most important altruistic work, then trying to hyper-optimize is altruistically counterproductive. I acknowledge that this is the principle behind the sentiment above, and evidently some people’s effectiveness has... (read more)

For what it's worth, my experience hasn't matched this. I started becoming concerned about the prevalence of net-negative lives during a particularly happy period of my own life, and have noticed very little correlation between the strength of this concern and the quality of my life over time. There are definitely some acute periods where, if I'm especially happy or especially struggling, I have more or less of a system-1 endorsement of this view. But it's pretty hard to say how much of that is a biased extrapolation, versus just a change in the size of my empathy gap from others' suffering.

But only some s-risks are very concerning to utilitarians -- for example, utilitarians don't worry much about the s-risk of 10^30 suffering people in a universe with 10^40 flourishing people.

Utilitarianism =/= classical utilitarianism. I'm a utilitarian who would think that outcome is extremely awful. It depends on the axiology.

Longtermism, as a worldview, does not  want present day people to suffer; instead, it wants to  work towards a future with as little suffering as possible, for everyone.

This is a bit misleading. Some longtermists, myself included, prioritizing minimizing suffering in the future. But this is definitely not a consensus among longtermists, and many popular longtermist interventions will probably increase future suffering (by increasing future sentient life, including mostly-happy lives, in general).

I think the strength of these considerations depends on what sort of longtermist intervention you're comparing to, depending on your ethics. I do find the abject suffering of so many animals a compelling counter to prioritizing creating an intergalactic utopia (if the counterfactual is just that fewer sentient beings exist in the future). But some longtermist interventions are about reducing far greater scales of suffering, by beings who don't matter any less than today's animals. So when comparing to those interventions, while of course I feel really horr... (read more)

Longtermism is probably not really worth it if the far future contains much more suffering than happiness

Longtermism isn't synonymous with making sure more sentient beings exist in the far future. That's one subset, which is popular in EA, but an important alternative is that you could work to reduce the suffering of beings in the far future.

2
Corentin Biteau
2y
Oh yeah, I just remembered that moral circle expansion was part of longtermism, that's true.  It's just that I mosty hear about longtermism when it comes to existential risk reduction - my point above was more about that topic.

Thanks for the kind feedback. :) I appreciated your post as well—I worry that many longtermists are too complacent about the inevitability of the end of animal farming (or its analogues for digital minds).

Ambitious value learning and CEV are not a particularly large share of what AGI safety researchers are working on on a day-to-day basis, AFAICT. And insofar as researchers are thinking about those things, a lot of that work is trying to figure out whether those things are good ideas the first place, e.g. whether they would lead to religious hell.

Sure, but people are still researching narrow alignment/corrigibility as a prerequisite for ambitious value learning/CEV. If you buy the argument that safety with respect to s-risks is non-monotonic in proximity ... (read more)

I'm pretty happy to bite that bullet, especially since I'm not an egoist. I should still leave my house because others are going to suffer far worse (in expectation) if I don't do something to help, at some risk to myself. It does seem strange to say that if I didn't have any altruistic obligations then I shouldn't take very small risks of horrible experiences. But I have the stronger intuition that those horrible experiences are horrible in a way that the nonexistence of nice experiences isn't. And that "I" don't get to override the preference to avoid such experiences, when the counterfactual is that the preferences for the nice experiences just don't exist in the first place.

Note that there are normative views other than discounting and person-affecting views that do not prioritize reducing existential risks—at least extinction risks specifically, which seem to be the large majority of existential risks that the EA community focuses on. I discuss these here.

Load more