Skip to content

A verdict is a structured statement with five required fields. A verdict that substitutes a scalar for a structured statement has discarded the information the framework is designed to preserve.

[component] [behavior/computation] [in model, on task] [verdict-strength] [mode tag]

Example:

L8.MLP causally mediates subject-verb number agreement in GPT-2 Small on the Linzen SVA dataset — Mechanistically supported [implementational]

FieldValue
ComponentL8.MLP
Behavior/computationcausally mediates subject-verb number agreement
Scope (model, task)GPT-2 Small, Linzen SVA dataset
Verdict-strengthMechanistically supported
Mode tag[implementational]
TierMeaningMinimum evidence
ProposedHypothesis-generating; no intervention evidence yetStructural or representational evidence only
Causally suggestivePartial internal validity; necessity shown, sufficiency not establishedAblation degrades behavior; no restoration test
Mechanistically supportedNecessity and sufficiency established; specificity and consistency partially addressedAblation + patching; ≥2 ablation variants
TriangulatedInternal validity fully addressed; ≥1 external and ≥1 construct criterion metAll five internal criteria + cross-task or cross-scale
ValidatedAll five validity types addressed with explicit baselines and convergent evidenceFull audit passed
UnderdeterminedEvidence consistent with multiple mechanisms; construct validity cannot resolveActive evidence present; interpretation ambiguous
DisconfirmedFails decisively on a key validity criterionExplicit falsification criterion met

After the verdict-strength tier is assigned, add the mode tag:

If the evidence establishes…Licensed tag
The locus of an effect (which component, which layer)[implementational]
The operation performed (what the component computes)[algorithmic]
The encoding geometry (what the activations represent)[representational]
The weight-space structure (without forward pass)[structural]
The distribution of labor across components[architectural]
Cross-model survival of the mechanism[transportable]
The computational problem and why it is solved[computational]

A partial verdict names the validity types that have been addressed and explicitly marks the others as open:

L8.MLP causally mediates SVA in GPT-2 Small — Mechanistically supported [implementational]; external validity and interpretive validity unaddressed.

The “unaddressed” clause is not optional. Its purpose is to prevent silent upgrading by selective citation.

  • A score. “IIA = 0.48” is not a verdict.
  • A confidence. “We are confident that L8.MLP is a key node” is not a verdict.
  • A universal claim. A verdict is always scoped to a model and a task.