Skip to content

The criteria in the evaluation pipeline do not come from nowhere. Each one is grounded in a specific intellectual tradition that developed it for a reason — usually because a field got burned by ignoring it. Falsifiability is a criterion because philosophy of science spent a century learning what happens when claims cannot be disconfirmed. Necessity-plus-sufficiency is a criterion because neuroscience spent decades discovering that necessity alone does not establish implementation. Dose-response is a criterion because pharmacology learned that single-dose measurements hide compensatory dynamics.

We call these traditions lenses. A lens is not a method or a metric — it is an analytical vocabulary, a way of thinking about evidence that shapes which questions you ask and how you interpret the answers. The framework draws on eleven lenses from five fields.

Five lenses map one-to-one to the five validity types. Each core lens provides the conceptual foundation for its validity type’s criteria — the “why” behind the formal specifications on the validity type pages.

Core lensValidity typeWhat it contributes
Philosophy of ScienceConstructFalsifiability, convergent validity, the distinction between observables and theoretical entities
Measurement TheoryMeasurementReliability, invariance, the MTMM matrix, the distinction between the construct and the instrument
NeuroscienceInternalSingle vs double dissociation, lesion vs stimulation, constitutive relevance, multimodal parcellation
PharmacologyExternalDose-response curves, functional selectivity, receptor reserve, the distinction between affinity and efficacy
Mechanistic InterpretabilityInterpretiveDescription vs explanation, faithfulness vs understanding, Marr’s levels, the description-mode hierarchy

You do not need to read the lens pages to use the framework. The criteria stand on their own with formal pass conditions and thresholds. But when a criterion seems arbitrary — why require a double dissociation and not just a single one? why demand cross-family convergence and not just within-family agreement? — the lens page explains the intellectual history that motivates it.

Six additional lenses provide analytical tools that strengthen specific validity types without defining new ones. They are not mapped to validity types one-to-one; instead, each contributes methods or concepts that apply across multiple types.

Supporting lensWhat it contributesValidity types it supports
Control TheoryStability margins, settling depth, observability conditions, PID-inspired steeringInternal, External
Dynamical SystemsRenormalization group flow, Koopman/DMD analysis, topological data analysis, critical phenomenaConstruct, Measurement
EconomicsArbitrage search (can an alternative circuit substitute?), game-theoretic interaction models, price-impact linearityConstruct, Internal
GeneticsKnockout hierarchies, rescue experiments, Mendelian randomization, dose-response from molecular biologyInternal, External
GeometryFisher-Rao distances, angular steering, sheaf consistency, symmetry and equivarianceMeasurement, Construct
Information TheoryMutual information, transfer entropy, PID, information bottleneck, Granger causalityInternal, Construct

Supporting lenses are the source of many of the framework’s protocols. The genetics lens contributes 16 molecular-biology protocols (knockout hierarchies, rescue experiments, Mendelian randomization, and others adapted from experimental biology). The economics lens contributes arbitrage search — a protocol that tests construct validity by asking whether an alternative set of components can substitute for the claimed circuit. If yes, the circuit is not uniquely necessary; the construct may not carve computation at its joints.

The table below shows every lens, the validity type it grounds (for core lenses) or supports (for supporting lenses), and the specific criteria it motivates. This is the full picture — every criterion in the framework traces back to at least one lens.

LensTypeValidity typeCriteriaKey question
Philosophy of ScienceCoreConstructC1 Falsifiability, C2 Structural plausibility, C3 Task specificity, C4 Minimality, C5 Convergent validityIs the entity you named a real thing?
Measurement TheoryCoreMeasurementM1 Reliability, M2 Invariance, M3 Baseline separation, M4 Sensitivity, M5 Calibration, M6 Construct coverageCan you trust the number?
NeuroscienceCoreInternalI1 Necessity, I2 Sufficiency, I3 Specificity, I4 Consistency, I5 Confound controlDoes the component implement the computation?
PharmacologyCoreExternalE1 Intervention reach, E2 Graded response, E3 Selectivity, E4 Effect magnitude, E5 Robustness, E6 Cross-architectureDoes it generalize?
Mechanistic InterpretabilityCoreInterpretiveV1 Level declaration, V2 Level-evidence match, V3 Narrative coherence, V4 Alternative exclusion, V5 Scope honestyIs the story right?
Control TheorySupportingInternal, ExternalSettling depth, stability margins, observabilityIs the circuit steerable and stable?
Dynamical SystemsSupportingConstruct, MeasurementKoopman/DMD modes, renormalization flow, TDADoes the structure persist across scales?
EconomicsSupportingConstruct, InternalArbitrage search, game-theoretic interactionsCan something else substitute for it?
GeneticsSupportingInternal, ExternalKnockout hierarchies, rescue, Mendelian randomizationDoes it behave like a genetic pathway?
GeometrySupportingMeasurement, ConstructFisher-Rao distances, sheaf consistency, equivarianceDoes the geometry match the computation?
Information TheorySupportingInternal, ConstructMI, transfer entropy, PID, Granger causalityDoes information flow through it?

Lenses sit outside the five-step evaluation pipeline. They are reference material — the intellectual context that explains why the pipeline’s criteria exist. A researcher who wants to evaluate a claim follows the pipeline (scope → evidence → criteria → verdict). A researcher who wants to understand why a criterion matters, or who wants to design a new protocol, reads the relevant lens.

The relationship is: the pipeline is how you evaluate; lenses are how you think.