How the corpus works
The corpus is the evidence layer of a13i. The book makes claims; the corpus is where claims earn the right to be made.
What an experiment is
Section titled “What an experiment is”An experiment is a reproducible study of how AI systems perform against interfaces, where accessibility-relevant properties of the interface are the variable. Examples of the genre:
- The same content rendered as semantic HTML and as div soup, with agents performing identical tasks against both.
- Forms with and without programmatic labels, measured for agent completion accuracy.
- Pages with correct, missing, and abusive ARIA, measured for both screen reader output quality and agent comprehension.
- Token cost of equivalent pages at different levels of markup hygiene.
Standards
Section titled “Standards”Every experiment follows corpus/TEMPLATE.md and the lifecycle in CONTRIBUTING.md (proposed, in-progress, reported, replicated, contested). The non-negotiables:
- Pinned model IDs, dates, parameters, and full prompts.
- Published materials: the actual HTML variants, tasks, and scoring rubrics.
- Raw outputs alongside summaries.
- Threats to validity stated by the authors, not just by critics.
- Null and negative results published with the same prominence as positive ones.
Why this rigor
Section titled “Why this rigor”Two failure modes would kill this project. The first is folklore: plausible claims repeated until they harden into best practice without ever being tested. The second is marketing: vendors citing a13i to sell overlays and miracle widgets. Method discipline is the defense against both.