#production-systems
2 posts filed under “production-systems”
The 10-Minute AI POC That Becomes a 10-Month Nightmare
It started with a Jupyter notebook. 'Look, I built a chatbot in 10 minutes!' Nine months later, three engineers had quit and the company almost folded.
The AI Evals Rebuild: How to Actually Test AI Systems
After exposing what's broken with AI evaluation, here's the radical solution: throw out benchmarks and test in production reality.