Make reliability visible.
AI companies publish capability claims. Users live with misses. This index makes output quality legible in public: user-reported flags, visible denominator, top failure reasons, and a review ladder instead of silent drift.
Zero baseline. First public flags set the benchmark, and the denominator stays visible the whole time.