Law-Following AI 1: Sequence Introduction and Structure

Cullen 🔸

Law-Following AI 1: Sequence Introduction and Structure

Comments 2

Sorted by

New & upvoted

Hi Cullen, this is a fantastic sequence. Have you considered publishing it as a paper on Arxiv or the like, so that it can be more readily cited in other work?

I recently wrote a summary of work on “Legal AI”, arguing that it is an important research direction for alignment but noting that “there exists no thorough overview of legal AI from a longtermist perspective.”Clearly this was incorrect, you’ve written it. I had not yet come across your sequence, in part because it was not cited as a motivation for any of the papers I’d read.

I’m planning to follow up within the next few weeks with more concrete questions about potential research directions on this topic, but just wanted to leave that brief note and say kudos on the work.

Cullen 🔸

Thanks Aidan! I do regret this not being as discoverable as optimal, but I'm kinda publishing this as I'm writing it. Maybe I'll do a white paper as I refine the central points.

We should chat about this sometime. I also didn't see your discussion of law; seems like we have untapped shared interests!

Comments

More from the author

303

What Makes Outreach to Progressives Hard

Cullen 🔸·5y ago·11m read

107

Polio Lab Leak Caught with Wastewater Sampling

Cullen 🔸·3y ago·1m read

108

Should EA Buy Distribution Rights for Foundational Books?

Cullen 🔸·6y ago·2m read

Curated and popular this week

Cultivating hope: calibrating the expectations for cultivated meat to end factory farming

PabloAMC 🔸·1w ago·Curated 1d ago·22m read

GWWC's 2025 impact evaluation (executive summary)

Aidan Whitfield🔸, Giving What We Can🔸·3d ago·2m read

This post presents the executive summary from Giving What We Can’s impact evaluation for 2025. At the end of this post we share links to more information, including the full report and...

Maybe do the thing you wish CEA would do

alejoacelas 🔸·1d ago·2m read

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague sense, CEA spinouts: * Kairos is directly a spinout of CEA and now handles most support for university AI safety groups. Basically everyone I've found who knows them is really excited about what they do * NEST is an opinionated ideas-fi...

Recent opportunities to take action

RP is looking for project founders in neglected animal areas

Rethink Priorities·1d ago·7m read

Announcing the Safe Pareto Improvements (SPI) Fundamentals Program

Center on Long-Term Risk, Anthony DiGiovanni 🔸, Santeri T 🔹·17h ago·3m read

Effective petitions (July 2026)

Stijn Bruers 🔸·15h ago·1m read

For early, informal discussion on this topic, see Michael St. Jules, What are the challenges and problems with programming law-breaking constraints into AGI?, Effective Altruism Forum (Feb. 2, 2020), https://forum.effectivealtruism.org/posts/qKXLpe7FNCdok3uvY/what-are-the-challenges-and-problems-with-programming-law [https://perma.cc/HJ4Y-XSSE] and accompanying comments. ↩︎
Whether such rules are actually encoded into legislation is not particularly important. Virtually all legal rules not part of public law can be made “legal” with regards to particular parties as part of a contract, for example. In any case, the heart of LFAI is being bound to follow rules, and interpreting those rules leveraging the rich body of useful rule-interpretation metarules from law. ↩︎
This is important because one of the core functions of law is to provide metarules regarding the interpretation of rules, guided by certain normative values (e.g., fairness, predictability, consistency). Indeed, rules of legal interpretation aim to solve many problems relevant to AI interpretation of instructions. Cf. Dylan Hadfield-Menell & Gillian Hadfield, Incomplete Contracting and AI Alignment (2018) (preprint), https://arxiv.org/abs/1804.04268. ↩︎
That is, the AI is not law-following just because the principal wants the AI to follow the law. Indeed, LFAI should disobey orders that would require it to behave illegally. ↩︎
That is, the AI is not law-following just because it is instrumentally valuable to it (because, e.g., being caught breaking the law would cause the AI to be turned off). ↩︎
As Ngo says, "My opinion is that defining alignment in maximalist terms is unhelpful, because it bundles together technical, ethical and political problems. While it may be the case that we need to make progress on all of these, assumptions about the latter two can significantly reduce clarity about technical issues." ↩︎
Cf., e.g., Dario Amodei et al., Concrete Problems in AI Safety 4 (2016), https://arxiv.org/pdf/1606.06565.pdf. ↩︎
I don't here offer an opinion on what training regime would yield such an outcome—my hope is to get someone to answer that for me! ↩︎
This approach may work particularly well when combined with insurance requirements for people deploying AI systems. ↩︎
In the same way that an intent-aligned AI will sometimes ask for clarifications from a human principal. See Christiano. ↩︎
Note that there are ELK-style problems with this approach. If an AI is asking for legal advice and wants to minimize the negative signal it gets from the Counselor, it may hide certain relevant information (e.g., its true state of knowledge or its true intentions) from the Counselor. A good solution, as discussed, could be to simulate an idealized adjudication of the issue if all the parties knew all the relevant facts and had equal legal firepower. But incentivizing the LFAI to tell the Counselor its true knowledge/intentions is an ELK problem. In the limit, the Counselor need not strictly be a distinct agent from the LFAI: an LFAI system may have Counselor capabilities and run this "consultation" process internally. Nevertheless, it is illustratively useful to imagine a separation of the LFAI and the Counselor. ↩︎
This would be idealized so that details not ultimately relevant to the substantive legality of the action (e.g., jurisdiction, AI personhood, other procedural matters, asymmetries in legal firepower) can be ignored. See the final footnote of this piece for further discussion. ↩︎
See the Appendix for more discussion on this point. ↩︎
See Battery, Wex , https://www.law.cornell.edu/wex/battery (last accessed Sept. 3, 2021). ↩︎
See, e.g., Intel Corp. v. Hamidi, 71 P.3d 296, 304–08 (Cal. 2003) (applying trespass to chattels to unauthorized electronic computer access); MAI Sys. Corp. v. Peak Computer, Inc., 991 F.2d 511, 518–19 (9th Cir. 1993) (storing data in RAM sufficient to create a "copy" for copyright purposes, despite the fact that a "copy" must be "fixed in a tangible medium"); cf. United States v. Jones, 565 U.S. 400, 406 n.3 (2012) (analogizing GPS tracking to in-person surveillance for Fourth Amendment purposes). ↩︎
See, e.g., Jonathan H. Blavin & I. Glenn Cohen, Gore, Gibson, and Goldsmith: The Evolution of Internet Metaphors in Law and Commentary, 16 Harv. J.L. & Tech. 265 (2002). ↩︎
However, the case for working on LFAI certainly diminishes with the number of applicable laws. ↩︎
This raises further issues, including the possibility of self-reference. For example, an LFAI or Counselor asymmetrically deployed by one litigant may be able to persuade a judge or jury of its position, even if it's not the best outcome. To avoid this, such simulations should assume that judges and juries are fully apprised of all relevant facts (i.e., neither the LFAI nor Counselor can obscure relevant evidence) and if deployed in the simulated proceeding are symmetrically available to both sides. ↩︎

Law-Following AI 1: Sequence Introduction and Structure

Law-Following AI 1: Sequence Introduction and Structure

Key Definitions

A Sketch of LFAI

Appendix: More Conceptual Clarifications on LFAI

Applicability of Law to AI Systems

Predicting Legality