#testing

5 posts filed under “testing”

Sep 6, 2025

The AI Evals Rebuild: How to Actually Test AI Systems

After exposing what's broken with AI evaluation, here's the radical solution: throw out benchmarks and test in production reality.

Jul 12, 2025

#content-pipeline #product-management #quality-assurance

Mastering the Full Content Pipeline Test

Introduction Shipping broken content is a costly mistake. A seemingly minor glitch can lead to lost revenue, damaged brand reputation, and frustrated users.

Jul 12, 2025

#ai-integration #multi-agent-systems #system-reliability

Your Multi-AI Testing Strategy Will Fail

Traditional testing approaches catastrophically fail for multi-AI systems. I've watched teams spend months on test suites that caught zero production failures.

Jul 7, 2025

#ai-evaluation #ai-systems #evals

The Evaluation Infrastructure We Need: Why AI Testing is Fundamentally Broken

Current AI evaluation approaches are built for software, not systems that reason. Here's the infrastructure we actually need.

Jun 28, 2025

#qa #testing #velocity

Testing at Light Speed: How QA Adapts to AI Velocity

"How can we possibly test features that are built in hours?" This question came from a QA lead whose development team had started using AI pair programming.

all tags

#testing

5 posts filed under “testing”

Sep 6, 2025

#ai #ai-evaluation #evals

The AI Evals Rebuild: How to Actually Test AI Systems

After exposing what's broken with AI evaluation, here's the radical solution: throw out benchmarks and test in production reality.

Jul 12, 2025

#content-pipeline #product-management #quality-assurance

Mastering the Full Content Pipeline Test

Introduction Shipping broken content is a costly mistake. A seemingly minor glitch can lead to lost revenue, damaged brand reputation, and frustrated users.

Jul 12, 2025

#ai-integration #multi-agent-systems #system-reliability

Your Multi-AI Testing Strategy Will Fail

Traditional testing approaches catastrophically fail for multi-AI systems. I've watched teams spend months on test suites that caught zero production failures.

Jul 7, 2025

#ai-evaluation #ai-systems #evals

The Evaluation Infrastructure We Need: Why AI Testing is Fundamentally Broken

Current AI evaluation approaches are built for software, not systems that reason. Here's the infrastructure we actually need.

Jun 28, 2025

#qa #testing #velocity

Testing at Light Speed: How QA Adapts to AI Velocity

"How can we possibly test features that are built in hours?" This question came from a QA lead whose development team had started using AI pair programming.