Skip to content
Validity typeInterpretive
Pass conditionA specific description-mode tag is stated explicitly in the verdict
Evidence familyN/A (criterion is about claim structure)
Minimum reportingThe mode tag, stated verbatim, in the verdict section of any published claim
Common failure modePublishing a result without a declared mode tag; leaving the claim’s scope implicit

Level declaration is the simplest interpretive criterion: before any claim can be evaluated for interpretive validity, it must declare what level of description it is making. Without a declared level, there is no standard against which to measure evidence–claim fit.

The five description mode tags (from ../00_taxonomy/index.md):

TagMeaningMinimum validity requirements
[functional]Describes input-output behavior without mechanismNone beyond behavioral evidence
[representational]Claims a variable is encoded at a componentBaseline-separated IIA or equivalent
[causal-mechanistic]Claims a component causally implements a computationNecessity + sufficiency established
[structural-mechanistic]Claims a component’s weights implement a computationStructural plausibility + causal support
[transportable]Claims the mechanism generalizes across contextsAt least one robustness result

A verdict without one of these tags is not a verdict — it is a measurement with a story attached.

The most common interpretive failure in MI is implicit level inflation: a paper establishes [representational] evidence (high IIA) and implicitly claims [causal-mechanistic] status without the additional evidence that requires. Level declaration forces the implicit claim to be explicit, where it can be evaluated.

Every verdict must contain one of the five tags verbatim, followed immediately by the scope restriction:

Verdict: [causal-mechanistic] for L8.MLP as a primary SVA locus in GPT-2 Small on the Linzen et al. prompt distribution. Not yet [transportable] — cross-architecture generalization has not been established.”

Level declaration (V1) is the prerequisite for level–evidence match (V2). You cannot check whether the evidence licenses the claimed level until the claimed level is stated.