Evidence-first analysis of AI infrastructure, policy, and practice, alongside the software that supports it.
Research & Analysis
Long-form research briefs and continuously updated data products measuring how AI is built, governed, and adopted.
AI Reports
Research briefs on artificial intelligence policy, infrastructure, and industry dynamics: evidence tables, entity spotlights, and explicit assessments.
Interactive censusState of the Agent
Analyses of large-scale Claude Code agent behavior, with the full census running interactively in the browser.
AI Delegation Curve ↗
A composite measure of consequential decision-making by AI across 9 domains, with versioned, inspectable methodology.
AI Software
Open-source software for specifying, evaluating, and operating autonomous coding agents.
Threadmark ↗
A context continuity sidecar for Codex & Claude Code that watches work checkpoints at meaningful boundaries and harness hook events to handoff compact briefings to the next agent entering the workspace -- all without resuming session history.
agent-evals
Overlap analysis, boundary testing, and metacognitive scoring for LLM coding agents.
Simplex
Specification for autonomous agent software construction: what to build and how to verify it is done, backed by a linter and an interactive planner.
Writing Rules
Editorial rulebook for cleaning up AI-drafted prose: cadence antipatterns, fixes, and a downloadable style guide.