#llm

4 posts filed under “llm”

Jul 12, 2025

How to create custom evaluations, model-graded assessments, and domain-specific benchmarks that actually predict real-world performance

Jul 12, 2025

Last week, I shared how I built Fission, a high-performance sandbox for executing LLM-generated code using Firecracker microVMs.

Jun 25, 2025

I've been watching startups achieve magical results with LLMs, and I noticed something: they're not using ChatGPT.

Jun 25, 2025

Every time an LLM generates code, you face a choice: trust it blindly or spend hours reviewing it. Neither option scales.