Skip to content
RoleOrganizing spine of the entire framework
LayersA (Instruments) → B (Evidence families) → C (Criteria) → D (Validity types) → E (Verdicts)
Verdict annotationMode tag: the seven description levels
Last updated16 May 2026

Every circuit claim in mechanistic interpretability is a chain from a concrete measurement to a conclusion. The taxonomy names every link in that chain. Reading bottom-up is how you build a claim. Reading top-down is how you evaluate one.

Mode tag ── [computational] [algorithmic] [representational]
[implementational] [architectural] [structural] [transportable]
Layer E ── Verdict ────────── The claim, stated with explicit scope and mode tag
Layer D ── Validity types ─── The five abstract questions a claim must answer
│ ├── Construct
│ ├── Internal
│ ├── External
│ ├── Measurement
│ └── Interpretive
Layer C ── Criteria ───────── ~27 specific, falsifiable conditions, grouped by type
Layer B ── Evidence families ─ The six kinds of signal an instrument can produce
│ Causal | Structural | Representational | Behavioral | Info-theoretic | Measurement
Layer A ── Instruments ────── The concrete runnable tests

A claim that skips Layer D is not a finding. It is a measurement with a story attached.

DirectionUse case
Bottom-up (A → E)Building a new claim: what does my instrument establish?
Top-down (E → A)Auditing an existing claim: what evidence would this verdict require?
Sideways (across Layer B)Checking convergent validity: do independent evidence families agree?
Mode tag lastAfter the verdict is assembled, check whether the declared description level is licensed

The layers form a dependency order:

  1. Layer A must be run before Layer B can be assigned.
  2. Layer C cannot be assessed before Layer B.
  3. Layer D cannot be satisfied before Layer C.
  4. Layer E cannot be written before Layer D.
  5. The mode tag is applied to Layer E last.

The dependency order is the reason the audit procedure runs construct validity first and interpretive validity last.

Every verdict carries a bracketed description-mode tag that names the level of description the claim operates at.

TagClaim typePage
[computational]What is being computed and whyA_computational.md
[algorithmic]What operation is performedB_algorithmic.md
[representational]What is encoded, where, howC_representational.md
[implementational]Which weights/components carry itD_implementational.md
[architectural]How computational labor is distributedE_architectural.md
[structural]What the weights say before any inputF_structural.md
[transportable]Which features survive cross-model shiftG_transportable.md

For how the seven modes extend Marr’s three levels, see H_marr-comparison.md.