LatentEvals
Services
Beliefs
Contact
Beliefs
What we believe about the state of AI evaluation.
The Evaluation Gap
The evaluation problem is now harder than the modeling problem. Why public benchmarks decay, models outrun their tests, and the field needs a different approach.