all tags

#production-systems

2 posts filed under “production-systems

The 10-Minute AI POC That Becomes a 10-Month Nightmare

It started with a Jupyter notebook. 'Look, I built a chatbot in 10 minutes!' Nine months later, three engineers had quit and the company almost folded.

The AI Evals Rebuild: How to Actually Test AI Systems

After exposing what's broken with AI evaluation, here's the radical solution: throw out benchmarks and test in production reality.