Writing

Notes on verifying what AI systems do.

Essays and open data from the lab: where AI systems behave, where they fail, and how to prove the difference. Every piece links to something you can check or run yourself.

Also published on GitHub LinkedIn