all tags

#ai-systems

1 post filed under “ai-systems

The Evaluation Infrastructure We Need: Why AI Testing is Fundamentally Broken

Current AI evaluation approaches are built for software, not systems that reason. Here's the infrastructure we actually need.