#ai-systems
1 post filed under “ai-systems”
The Evaluation Infrastructure We Need: Why AI Testing is Fundamentally Broken
Current AI evaluation approaches are built for software, not systems that reason. Here's the infrastructure we actually need.
1 post filed under “ai-systems”
Current AI evaluation approaches are built for software, not systems that reason. Here's the infrastructure we actually need.