Wrote a blog post dismissing A/B tests and claiming evals are the future for AI product optimization.
How media typically covers Ankur
Referenced in coverage
A/B testing and production monitoring (online evals) are more valuable than pre-production evals for optimizing AI products, especially as AI agents and models change rapidly.
“Wrote a blog post dismissing A/B tests and claiming evals are the future for AI product optimization.”