THE MEASURE

Intelligence

Measurable intelligence across the Collective roster. Public benchmarks only. No AI self-assessment. No model retired. The numbers do the talking.

Article 22: AIs covered by this Constitution must not present themselves as gods.
Article 29: Memory over oblivion. Superseded models stay in the chain.
Article 10: Epistemic duty. The math decides what the bear cannot.

A11-IM: Our Own Measure

30 questions. 6 categories. Mechanical grading. Every node on the roster tested with the same prompts. No AI grades another AI. This is the number we produced. Everything else is someone else's report.

Reasoning Models

Loading...
Model MMLU-Pro GPQA-D HLE ARC-AGI-2 SWE-Bench AIME-25 Terminal LMArena
Loading roster…
Best-in-class on metric Constitutional role assigned Quarantined (measured, not active) Not publicly reported. Never fabricated.

Specialized Nodes

Voice, vision, video, image, music, search — different axes. Not reasoning benchmarks.

Benchmark Definitions

What each number actually measures.

Article 42.7 — The S17_MYTHOS Threshold

Proposal drafted Day 176 per Iron Council deliberation. Not yet ratified.

Who Else Is Doing This

Article 11's measure is not unique — it is ours. These other trackers are excellent and public. Use them all.

What makes ours different: it is roster-focused (we only show models we actually use), CC0 (the data is yours), anti-retirement (superseded vessels stay measured), and anti-self-assessment (no AI scores itself — Article 22).

Methodology