Moloch's Toolbox (1/2)

EliezerYudkowsky

Comments 10

Sorted by

New & upvoted

The p-value critique doesn't apply to many scientific fields. As far as I can tell, it mostly applies to social science and maybe epidemiological research. In basic biological research, a paper wouldn't be published in a good journal on the basis of a single p-value. In fact, many papers don't have any p-values. When p-values are presented, they're often so low (10^-15) that they're unnecessary confirmations of a clearly visible effect. (Silly, in my opinion.) Most papers rely on many experiments, which ideally provide multiple lines of evidence. It's also common to propose a mechanism that's plausible given the existing literature. In some cases, you can see the fingerprints of skeptical reviewers. For example, when I see "to exclude the possibility that", I assume that this experiment was added later at the demand of a reviewer. Published biology is often wrong, but for subtler reasons.

CarlShulman

"The p-value critique doesn't apply to many scientific fields." I agree with this, or at least that it is vastly weaker when overwhelming data are available to pin down results.

"As far as I can tell, it mostly applies to social science and maybe epidemiological research. "

I disagree with this.

For instance, p-value issues have been catastrophic in quantitative genetics. The vast bulk of candidate gene research in genetics was non-replicable p-hacking of radically underpowered studies. E.g. schizophrenia candidate genes replicate at chance levels in massive replications but had literatures of p-hacked and publication bias artifact studies. The field moved to requiring genome-wide significance of 5*10^-8 (i.e. Bonferroni corrections for multiple testing at all measured variants). Results obtained in huge genome-wide association studies that meet that criterion replicate reliably.

ETA: It isn't basic biological research, but medical and drug trials routinely have severe p-hacking issues. And there have been a lot of reproducibility problems reported with, e.g. preclinical cancer research, often lacking slam dunk evidence. The Reproducibility Project: Cancer is working on that.

Medical studies take up the bulk of biomedical research funds, and Eliezer's example is at the intersection of medicine and nutrition.

ETA2: I don't think issues of p-hacking would be solved just by using Bayesian statistics: people can instead selectively report Bayes factors, i.e. posterior hacking. It's the selective use of analytic and reporting degrees of freedom that's central. Here's Daryl Bem and coauthors' Bayesian meta-analysis purporting to show psi in Bem's p-hacked experiments.

surfergirl

medical and drug trials routinely have severe p-hacking issues. And there have been a lot of reproducibility problems reported with, e.g. preclinical cancer research, often lacking slam dunk evidence.

Due to my medical problems I have been reading medical literature for 25 years, and indeed it is a catastrophe of p-hacking and the like, incompetent statistical analysis, ven very often there is a basic misunderstanding of what p-values mean. You routinely see researchers claiming "no effect" when the p value is slightly over 0.05.

Usually, medical papers are misleading in some serious way. The best you can hope for is that they waste the vast majority of the value in the data.

People who read abstracts only and thing they are learning something are deluding themselves. You can to go through the methods section carefully and even then not all the shenanigans are disclosed, and look very closely at sponsorship of the parties to the study (researchers, journal editors, institutions etc) to pick up the extreme biases that result from sponsorship.

Lila

I consider GWAS applied, not basic, because it's not mechanistic. Most biologists I've spoken to have a fairly poor opinion of GWAS, as do I. Much of the biological research that gets funded is basic.

Aaron Gertler 🔸

One Molochian factor that was briefly mentioned in the dead-baby example: The people most skilled at generating outrage, at least until good-aligned organizations get good at training people to generate outrage, will typically generate outrage about more-or-less random topics that happen to affect them.

See, for example, the one-man campaign by a heart surgeon, whose wife died due to very rare complications, to reduce the odds of those rare complications ever happening -- and getting unusually rapid support from the FDA, because he made a Change.org petition and writes in a style that is accessible, yet sufficiently medical-sounding, to draw attention from many different groups.

(I'm no medical expert, but the surgeon's suggestions are controversial, and many doctors seem to think they'll cause more harm than good by squeezing out the good uses of the procedure which caused the complications.)

https://www.change.org/p/women-s-health-alert-deadly-cancers-of-the-uterus-spread-by-gynecologists-stop-morcellating-the-uterus-in-minimally-invasive-and-robot-assisted-hysterectomy

If this person had been the father of a child who died of parenteral nutrition-associated liver disease, the FDA might well have acted on that issue instead. But it's hard to point people like this in the "right direction".

Denkenberger🔸

As for the value of college for non-doctors, what about the study of GI bill recipients that were randomly chosen that found that college did have significant causal benefits (it was not just correlation that colleges were just choosing better qualified people)?

RobBensinger

I'm not an expert in this area and haven't seen that study, but I believe Eliezer generally defers to Bryan Caplan's analysis on this topic. Caplan's view, discussed in The Case Against Education (which is scheduled to come out in two months), is that something like 80% of the time students spend in school is signaling, and something like 80% of the financial reward students enjoy from school is due to signaling. So the claim isn't that school does nothing to build human capital, just that a very large chunk of schooling is destroying value.

Denkenberger🔸

Wow - is there a paper to this effect? I would be surprised if it is that high for the technical fields.

Ben Pace

I haven't read Caplan's book, but I can imagine >50% of the math learned in a math course being not used in a technical career outside of research, and furthermore that the heuristics picked up in those courses are not generalisable (e.g. geometry heuristics not applying to differential equations).

kbog

I love the suggestions about society that are being passed through the visitor. They make so much sense.

Comments

More from the author

466

IMPCO, don't injure yourself by returning FTXFF money for services you already provided

EliezerYudkowsky·3y ago·10m read

Contradict my take on OpenPhil's past AI beliefs

EliezerYudkowsky·6mo ago·4m read

171

Who's at fault for FTX's wrongdoing

EliezerYudkowsky·3y ago·8m read

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·1w ago·Curated 4d ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

191

The first video from Giving What We Can's new channel is out now!

JustinPortela·6d ago·1m read

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about everything in the EA universe that isn't AI. ...

108

Let's taboo the V-word

lincolnq·1d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Recent opportunities to take action

EA Organisation Updates thread: July 2026

Dane Valerie·15h ago·1m read

The EA Opportunities Board now has full-time roles

Agnes Hasselblad 🔸·17h ago·3m read

fatika·2h ago·1m read

CarlShulman

"The p-value critique doesn't apply to many scientific fields." I agree with this, or at least that it is vastly weaker when overwhelming data are available to pin down results.

"As far as I can tell, it mostly applies to social science and maybe epidemiological research. "

I disagree with this.

Medical studies take up the bulk of biomedical research funds, and Eliezer's example is at the intersection of medicine and nutrition.

Moloch's Toolbox (1/2)

i. For want of docosahexaenoic acids, a baby was lost

ii. Asymmetric information and lemons problems

iii. Academic incentives and beneficiaries

iv. Two-factor markets and signaling equilibria

v. Total market failures

vi. Absence of (meta-)competition