#testing
2 posts
The Evaluation Infrastructure We Need: Why AI Testing is Fundamentally Broken
Existing evaluation infrastructure was built for deterministic software. AI systems are probabilistic, context-dependent, and non-reproducible. The...
Testing at Light Speed: How QA Adapts to AI Velocity
AI-generated code produces different bugs than human-written code. QA built for syntax checking is testing for the wrong failures.