May 7, 2026 · EVALUATION
Evals that stick: how to keep retrieval honest after launch
Most LLM evaluation suites die in the first month. The ones that survive are tied to a release decision the team actually makes.
Insights
Articles, rubrics, and operating notes from AI engagements with Philadelphia-area teams. Long-form papers live in papers.
May 7, 2026 · EVALUATION
Most LLM evaluation suites die in the first month. The ones that survive are tied to a release decision the team actually makes.