Skip to content

How the corpus works

The corpus is the evidence layer of a13i. The book makes claims; the corpus is where claims earn the right to be made.

An experiment is a reproducible study of how AI systems perform against interfaces, where accessibility-relevant properties of the interface are the variable. Examples of the genre:

  • The same content rendered as semantic HTML and as div soup, with agents performing identical tasks against both.
  • Forms with and without programmatic labels, measured for agent completion accuracy.
  • Pages with correct, missing, and abusive ARIA, measured for both screen reader output quality and agent comprehension.
  • Token cost of equivalent pages at different levels of markup hygiene.

Every experiment follows corpus/TEMPLATE.md and the lifecycle in CONTRIBUTING.md (proposed, in-progress, reported, replicated, contested). The non-negotiables:

  • Pinned model IDs, dates, parameters, and full prompts.
  • Published materials: the actual HTML variants, tasks, and scoring rubrics.
  • Raw outputs alongside summaries.
  • Threats to validity stated by the authors, not just by critics.
  • Null and negative results published with the same prominence as positive ones.

Two failure modes would kill this project. The first is folklore: plausible claims repeated until they harden into best practice without ever being tested. The second is marketing: vendors citing a13i to sell overlays and miracle widgets. Method discipline is the defense against both.