How to do theoretical research, a personal perspective

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·5d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Recent opportunities to take action

RP is looking for project founders in neglected animal areas

Rethink Priorities·3d ago·7m read

158

Possible mistake EAs are making and shout out to Pause AI UK

Michelle_Hutchinson·1w ago·4m read

Announcing the Safe Pareto Improvements (SPI) Fundamentals Program

Center on Long-Term Risk, Anthony DiGiovanni 🔸, Santeri T 🔹·2d ago·3m read

^{^}

i.e., ease of understanding arguments, including what is or isn't being assumed.

^{^}

For example, it might be harder to detect that a writer is strawmanning their intellectual opponents or such mischaracterizations might even be welcomed by some readers, whereas fraudulent data in STEM is egregious. (Part of the issue is that people might have more plausible deniability for strawmanning: it's harder to determine that they were acting in bad faith, and/or it might even be hard to determine that they were in fact misrepresenting others' views.)

Marcel2

I'm a bit confused by your distinction: the question "Did […]

If you can't find reliable data, that just makes it hard, not theoretical.

The use of “did” vs “would” wasn’t very intentional or precise.

As to the empirical vs. theoretical nature of my hypothesis, it is indeed claiming that certain relationships empirically existed (and, with a lot of caveats, may continue into the future). However, my point was that the research methods I used were much more “theoretical”: I couldn’t do a large-N empirical analysis or controlled experiments to even establish meaningfully-controlled correlation (let alone causation) between the dependent, control, and independent variables, and instead had to rely on lines of reasoning such as:

Hypothetical scenarios (e.g., imagine comparing an ambush where both parties have machine guns vs. one where neither side has machine guns)—which is impractical to clinically/experimentally test (I.e., with high reality fidelity)
More-qualitative (and somewhat subjective) comparison of case studies, using a large amount of argumentation/theoretical reasoning to deal with the many gaps and flaws in the case comparison (given that, as I noted, there didn’t seem to be any good case comparison pair in the historical record)
Agreement with existing theoretical and/or empirical concepts in the literature, such as Biddle’s Modern System.

it basically just seems to say "think about the problem until you can figure out how to test it with traditional empirical methods."

Well, yeah, what else would you expect? The post describes how you might use argument clashes and oversimplified simulations in thinking about the problem.

Again, perhaps I was being a bit too imprecise with my language? My point is that for some questions (arguably including my thesis), theoretical argumentation has to bear a lot of the analytical burden. This analytical burden can include things like:

Explaining why variables Q, K, and W—none of which you could experimentally control for—probably do or don’t affect the relationship;
Explaining why your very limited sample size can probably be extrapolated to some other cases;
Explaining why some metric is probably a decent proxy for what you actually are trying to measure;
Reasoning about hypothetical scenarios which will not actually empirically occur.

(Caveat: all of those activities can be supported by direct reference to supporting data in some situations, but not always.)

In contrast, it seems that much of the “theoretical” research methods described in this post are basically just “use lots of thinking to figure out how to test this empirically against data [at which point these empirical methods do almost all the legwork.]”

There is perhaps some debate to be had over the meaning of “theoretical” research methods: do mathematical proofs or algorithms count as theory? While I’m not universally opposed to using the term in such a context, I think it is much less helpful to use the term “theory” when you’re trying to juxtapose it with empirical methods. This especially feels true if a major reason you support a mathematical proof or algorithm is based on your determination that “this empirically works every single time.” When teaching research methods, I think it’s important to emphasize the differences that I described previously (e.g., legibility/transparency, reliability/consistency, reputation stake) which, in my view, have tended to make empirical methods so much more effective when they can be used.

How to do theoretical research, a personal perspective

How to do theoretical research, a personal perspective

How to do research

Figuring out what you want to happen in real-world cases

ELK Examples

Potshot algorithms

Translating what you want in real-world cases into desiderata for simple cases

ELK Example

Articulating an algorithm for solving simple cases

ELK Examples

Finding cases where your algorithm doesn’t do what you want

ELK Examples

Other random tips