What Runtime Interpretability Actually Costs, Part 1: The Case for Measuring It
Everyone assumes activation probes are too expensive to run in production. I ran the numbers on paper and I no longer believe it. Here is the argument, my predictions, and the harness I built to settle it.