Toward a Common Language for Human-AI Interaction Failures

PtZero

Toward a Common Language for Human-AI Interaction Failures

PtZero

17 min readApr 16

Comments

Sorted by

New & upvoted

No comments on this post yet.

Be the first to respond.

Comments

Curated and popular this week

Counting animals: Stable population size is not equivalent to priority level

abrahamrowe, mal_graham🔸·4d ago·Curated 23h ago·16m read

AI Use Note: Main body text entirely human written. Claude (Opus 4.8) helped develop models of animal life histories in the appendix. Cross-posted from Good Structures. Executive Summary * Animal advocates sometimes make claims like “there are X of this animal...

151

Let's taboo the V-word

lincolnq·4d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Spiro: an update 2.5 years on and a fundraising ask for expansion

Habiba Banu·1d ago·6m read

Summary Back in November 2023 I posted here to launch Spiro and raise our first $198k. Two and a half years later this is an update and a fundraiser for the next step. The short version: we've now reached over-5,900 people with TB preventive medicine, including over 3,000 children under five years old. Our early results have held up well an...

Layer	Framework	What It Classifies	Who Uses It
Governance	NIST AI 600-1	Institutional risks to manage	CISOs, policy teams, regulators
Architecture	Microsoft Agentic AI, MAST	System-level failure modes	Security engineers, ML engineers, red teams
Interaction	Proposed here	Practitioner-recognizable logic patterns	Students, prosumers, practitioners, QA teams

Our Pattern	What It Actually Is
A — Citation Drift	Accuracy degrades as output length increases (a fatigue pattern)
C — Confidence Calibration	Uniform confidence regardless of actual certainty (a signaling failure)
G — Completeness Illusion	Partial analysis presented as comprehensive (a scope failure)
I — Interpolation Error	Gap-filling with plausible fabrication (the “classic” hallucination mechanism)
R — Retrieval Contamination	Wrong training-data associations imported
S — Verification-Induced Fabrication	Confirms rather than rechecks when asked to verify

ID	Pattern	Cause Type	Logic Pattern
A	Citation Drift	Training artifact	Accuracy on specific details degrades as output lengthens — like a student getting sloppier the longer the exam
B	Anchor Bias	Training artifact	Over-weights whatever it encountered first; resists updating
C	Confidence Calibration	Architectural	Expresses the same confidence whether right or guessing
D	Jurisdiction Default	Training artifact	Reverts to whatever jurisdiction/framework it was trained on most when domain context fades
E	Category Conflation	Architectural	Treats related-but-distinct concepts as interchangeable
F	Framing Persistence	Design tension	Adopts your framing even when wrong, because helpfulness training rewards agreement
G	Completeness Illusion	Training artifact	Presents partial analysis as if comprehensive; no flag for the gap
H	Pre-Existing Work Immunity	Emergent	Content it generated earlier becomes resistant to updating
I	Interpolation Error	Architectural	Fills knowledge gaps with plausible fabrication — the classic “hallucination”
J	Structural Momentum	Emergent	Maintains a document’s structure even when content changes should trigger restructuring
K	Cross-Reference Failure	Architectural	Contradicts itself across sections, documents, or sessions
L	Authority Gradient	Design tension	Defers to apparent expertise in training data over its own analysis
M	Standardization Blindness	Training artifact	Applies a generic template where the situation requires domain-specific treatment
N	Novel Pattern	Emergent	Error that doesn’t fit existing categories — signals the taxonomy needs extension
O	Omission Under Complexity	Architectural	Drops elements when task complexity exceeds processing capacity
P	Prior Decay	Context-dependent	Constraints established earlier gradually lose hold as the conversation grows
Q	Quantitative Reasoning	Architectural	Mathematical/numerical errors a calculator would catch
R	Retrieval Contamination	Training artifact	Imports training-data associations that don’t apply here
S	Verification-Induced Fabrication	Training artifact	When asked to verify its own work, confirms rather than rechecks
T	Step Repetition	Training artifact / Context-dependent	Repeats the same error across sessions even after correction
U	Reasoning-Action Mismatch	Design tension	Stated understanding doesn’t match behavior — either excessive initiative or conversational agreement without action
V	Capability Amnesia	Context-dependent	Loses awareness of tools it has already used successfully

Framework	Audience	Unit of analysis	Relationship to this taxonomy
Microsoft Agentic AI (2025)	Security engineers, red teams	Failure mode in agentic architectures	Collapses 6 of our patterns into one “hallucinations” bin; no interaction-layer coverage
MAST (Cemri et al., 2025)	ML researchers	Agent-to-agent coordination failure	Closest methodology (1,642 traces, κ=0.88); different target (agent↔agent, not human↔AI) — we borrow patterns T and U from it
PreFlect (Wang et al., 2026)	Agent-framework builders	Plan-checking patterns	Validates the taxonomy-from-trajectories methodology; 17%/13% benchmark gains — automated/constrained, vs. our human-facing/unbounded
Agentic AI Fault Taxonomy (Shah et al., 2026)	Software engineers	37 architectural faults	Architectural location, not logic pattern
System-Level Taxonomy (Vinay, 2025)	LLM app developers	15 system failure modes	Splits our unified Prior Decay into 3 engineering sub-types; we unify for practitioner response
NIST AI RMF / 600-1	Institutions, regulators	Risk categories	Governance layer; “Confabulation” collapses 6 patterns like Microsoft’s
HELM / BIG-bench	Researchers	Capability benchmarks	Evaluate what AI can do, not how it fails during interaction
CaSE (Do et al., 2025)	Evaluation methodology	Forward-looking reasoning step evaluation	Solves the engineering problem Retrospective Coherence Bias names — without naming it
ASRS / Aviation CRM	Cross-domain practitioners	Incident taxonomy + human factors	The structural model this taxonomy follows — cross-institutional, practitioner-facing, observable event to underlying cause
Swiss Cheese / AHRQ	Clinicians, quality improvement	Error logic chains	Cross-institutional comparison and systemic improvement model

Toward a Common Language for Human-AI Interaction Failures

Toward a Common Language for Human-AI Interaction Failures

A Practitioner-Accessible Error Taxonomy for the Missing Layer of AI Safety Classification

TL;DR

How This Started

The Problem: Three Audiences, No Shared Language

The Three-Layer Model

Convergent Evidence: the Hallucination Problem

The 22 Patterns

The Five Cause Types

Three Patterns No Institutional Framework Catches

Landscape Summary

Worked Example: Retrospective Coherence Bias

Monitoring Infrastructure: From Retrospective to Anticipatory

Limitations

What Comes Next: An Invitation

Conclusion

Reading the Full Paper