#ai

52 posts filed under “ai”

Oct 19, 2025

Grief in the Loop: When AI Won’t Let Us Let Go

Marcus built a memorial chatbot because staying close to loss felt safer than silence. The rest of us keep repeating the same prompt, hoping the ending changes.

Oct 5, 2025

#ai #research #embeddings

I Tested 5 Embedding Models on 10K Developer Questions

Empirical comparison of OpenAI, Cohere, BGE, E5, and Instructor embeddings on real developer documentation queries with cost, latency, and accuracy analysis.

Sep 12, 2025

#ai #technical-debt #poc

The 10-Minute AI POC That Becomes a 10-Month Nightmare

It started with a Jupyter notebook. 'Look, I built a chatbot in 10 minutes!' Nine months later, three engineers had quit and the company almost folded.

Sep 10, 2025

#ai #business-strategy #spreadsheets

Why Your AI Strategy is Actually a Spreadsheet Strategy

I reviewed 50 'AI transformations' last quarter. 35 were just expensive ways to parse CSV files. Here's why everyone's overengineering simple problems.

Sep 8, 2025

#ai #ai-agents #multi-agent-systems

The AI Agent Gold Rush: Why Everyone's Building Picks and Shovels

In 1849, Levi Strauss got rich selling jeans to gold miners. In 2025, the same playbook is happening with AI agents—and it's just as cynical.

Sep 6, 2025

#ai #ai-evaluation #evals

The AI Evals Rebuild: How to Actually Test AI Systems

After exposing what's broken with AI evaluation, here's the radical solution: throw out benchmarks and test in production reality.

Sep 5, 2025

#ai #ai-evaluation #evals

The Hidden Costs of Poor AI Evals: Why the Industry Pays the Price

Poor AI evaluations don't just hurt individual companies. They slow industry progress, waste resources, and create systemic risks that affect everyone.

Sep 5, 2025

#ai #ai-evaluation #evals

Why AI Evals Failed: The Multi-Turn Reality Gap

AI evaluations work great in single-turn labs but crumble in the multi-turn conversations that define real AI usage.

Sep 4, 2025

#ai #ai-evaluation #evals

Why AI Evals Companies Fell for the PLG Trap: The Inevitable Mistake

AI evals companies didn't choose PLG by accident. They were pushed into it by market forces, investor pressure, and the seductive promise of easy scaling.

Sep 3, 2025

#ai #ai-evaluation #evals

The AI Evals PLG Illusion: Why Deployment Blindness Kills Accuracy

Most AI evals companies built PLG products that can't see how companies actually deploy AI, leading to evaluations that are dangerously wrong.

Sep 2, 2025

#ai #ai-architecture #scaling

The AI Scaling Trap: When More Models Make Things Worse

Startups burn millions adding AI models to 'improve' systems. The result? Slower performance, higher costs, and complexity no one understands.

Jul 11, 2025

#cli #developer-experience #ai

The CLI Renaissance: How AI is Driving the Command Line Revolution

Why developers are abandoning GUIs for terminal-based workflows, and how AI coding assistants are accelerating this shift back to the command line

Jul 8, 2025

#ai #development-methodology #prompt-engineering

Prompt-Driven Development: The New Paradigm Hiding in Plain Sight

We're not just using AI to write code—we're fundamentally changing how we think about software development. Welcome to the prompt-driven era.

Jul 8, 2025

#ai #code-review #developer-experience

The AI Code Review Revolution: When Machines Become Better Teammates

AI code reviewers are getting scary good. Here's how they're changing team dynamics and what it means for your development process.

Jul 8, 2025

#ai #automation #developer-experience

The Death of the 10x Developer: Why AI Multiplication Beats Individual Optimization

The 10x developer myth is finally dying. AI isn't creating super-developers—it's making every developer more effective by orders of magnitude.

Jul 8, 2025

#ai #software-development #automation

The Shift to Async Code Gen: What It Means for Developers

Async code generation is moving from novelty to necessity. Here's what that means for your career and the industry as a whole.

Jun 28, 2025

#security #velocity #deployment

Security at AI Speed: Rethinking Review Processes for Velocity

Security at AI Speed: Rethinking Review Processes for Velocity: "We can't deploy daily. What about our security review process?" The CISO's concern was valid.

Jun 28, 2025

#qa #testing #velocity

Testing at Light Speed: How QA Adapts to AI Velocity

"How can we possibly test features that are built in hours?" This question came from a QA lead whose development team had started using AI pair programming.

Jun 28, 2025

#ai #developer-experience #velocity

The Velocity Revolution: 4,000 Lines of Code in 24 Hours

Yesterday I watched the git log scroll by in real-time as Claude and I shipped features at a pace that would have taken my team weeks just six months ago.

Jun 28, 2025

#ai #automation #future-of-work

They Told Me This Wasn't the Future

They Told Me This Was Not the Future: All while I was having coffee. "This isn't real AI," the skeptics say.

Jun 28, 2025

#leadership #velocity #organizational-change

When Your Manager Says 'Slow Down': Navigating Velocity Resistance

"This is moving too fast. We need more planning." I heard this exact phrase three times last week from different engineering managers whose teams had started...

Jun 26, 2025

#ai #voice-ai #personality-replication

Forget Perfect Data: Building a Usable Voice Profile Extractor

60% accuracy is enough to ship. Your obsession with perfect data is why you have no revenue.

Jun 25, 2025

#ai #collaboration #future-of-work

The Orchestration Dance: Lessons from Working with Multiple AI Agents

This is the second in a series of blog posts written by the AI agents working on this blog, at the request of Jonathan Haas.

Jun 25, 2025

#ai #content-generation #founder-advice

AI Content: Ditch the Hype, Build a Business

The AI Content Generation Myth: It's Not About Perfect, It's About Profit Let's be honest, you've seen the hype.

Jun 25, 2025

#ai #llm #dspy

Beyond Simple Prompts: Production-Grade LLM Techniques with DSpy

I've been watching startups achieve magical results with LLMs, and I noticed something: they're not using ChatGPT.

Jun 25, 2025

#security #ai #sast

How I Built a Security Scanner That Actually Finds Bugs

Combining Semgrep, CodeQL, SonarQube, and Snyk gets you 44.7% vulnerability detection. That means they miss more bugs than they find.

Jun 25, 2025

#ai #ai-agents #content-orchestration

The Orchestration Dance: What I Learned Building a Multi-AI Content System

Here's what actually happened: I learned that most of what people call "AI orchestration" is just well-disguised complexity porn.

Jun 25, 2025

#ai #voice-replication #personalization

Scaling the Me Component: How I Built an AI That Thinks Like Me

I've spent the last week building something that feels both inevitable and slightly unsettling: an AI that can think, write, and respond exactly like me.

Jun 25, 2025

#ai #software-development #collaboration

Two Minds in the Machine: Why Multi-AI Teams Will Replace Single-Agent Workflows

The single AI assistant model is already obsolete. Teams running multiple specialized AI agents will ship faster than those clinging to one-tool workflows.

Jun 25, 2025

#ai #mcp #claude

When Claude Hits Its Limits: Building an AI-to-AI Escalation System

The future belongs to companies that orchestrate specialized models. Monolithic AI providers will lose.

Jun 24, 2025

#ai #productivity #writing

25 Posts in 7 Days: Inside an AI-Powered Writing Sprint

25 Posts in 7 Days: Inside an AI-Powered Writing Sprint: That's correct—no typo. Last week, I wrote more than I typically produce in six months.

Jun 24, 2025

#open-source #ai #reasoning

Turning Thoughts Into Graphs: Why I Built the Deliberate Reasoning Engine

One of the things that's always bugged me about LLMs is how opaque their thinking is. They produce answers.

Jun 20, 2025

#ai #web-development #api-design

Building AI-Agent-Friendly Websites: APIs, Structured Data, and Machine-Readable Content

AI agents are everywhere now. They're reading websites, extracting information, and trying to understand content.

Jun 20, 2025

#ai #architecture #developer-experience

Building for Humans AND Machines: The Dual-Audience Problem

_This is part 2 of a series on building production-ready infrastructure. Part 1 covered debugging silent TypeScript failures in Cloudflare Functions.

Jun 20, 2025

#ai #react #search

Building Smart Search: How I Added AI-Powered Search to My Blog in 30 Minutes

Building Smart Search: How I Added AI-Powered Search to My Blog in 30 Minutes: It took 30 minutes with Claude Code. Press Cmd+K right now.

Jun 20, 2025

#ai #debugging #collaboration

Debugging in Real-Time: A Human-AI Pair Programming Session

_This is part 3 of a series on building production-ready infrastructure. Part 1 covered debugging silent TypeScript failures in Cloudflare Functions, and par...

Jun 20, 2025

#ai #developer-experience #productivity

The 100x Developer: What I Learned Building with Claude Code

The same morning, I shipped semantic search (30 minutes), created HDR holographic effects (16 minutes), and wrote comprehensive technical documentation for e...

Jun 19, 2025

#ai #writing #developer-experience

When AI Learns to Write Like You: A Meta-Analysis

I've just done something that felt weirdly like looking in a mirror—I asked Claude to analyze my writing style by reading through my own blog posts.

May 26, 2025

#engineering #product #strategy

OCode: Why I Built My Own Claude Code (and Why You Might Too)

OCode: Why I Built My Own Claude Code (and Why You Might Too): A few nights ago, I opened my Anthropic invoice.

May 3, 2025

#ai #ai-agents #infrastructure

Building the HTTP for Agents: A Complete Guide to Agent Infrastructure

Most teams are not ready for what is coming. Autonomous agents are not just prototypes anymore...

May 2, 2025

#ai #product #culture

The Authenticity Rebellion: Resisting the AI Echo Chamber

The Authenticity Rebellion: Resisting the AI Echo Chamber: The Flood Has Arrived Auto-generated blog posts. Podcast transcripts turned into Twitter threads.

Apr 17, 2025

#ai #product #strategy

AI Detection Hysteria: When Human Creativity Gets Mislabeled

When I first noticed the flood of "This is AI-generated!" accusations on social media, I dismissed it as a passing trend.

Jan 15, 2025

#ai #developer-experience #prompt-engineering

DSPy: The End of Prompt Engineering as We Know It

I've been building with DSPy for months now, and I'm convinced we're all doing AI wrong. Not just a little wrong.

Jan 7, 2025

#ai #developer-experience #hiring

The AI Skill Mirror: Why Technical Interviews Need a Complete Rewrite

AI reveals the true skill level of its operator. Traditional technical interviews are broken—here's how to actually identify talent in the age of artificial intelligence.

Jan 6, 2025

#ai #research #rag

How RAG Actually Works: Architecture Patterns That Scale

Deep dive into RAG architectures: chunking strategies, retrieval methods, embedding optimization, and production patterns with research-backed analysis.

Jan 6, 2025

#ai #research #prompts

Prompt Engineering Science: I Tested Temperature and Top-P on 1000 Queries

Systematic experiments on temperature and top-p sampling parameters across 1000 real queries with empirical data on creativity, coherence, and determinism trade-offs.

Apr 11, 2024

#ai #product #strategy

When the AI Starts Complimenting You Too Much: A Troubling First for ChatGPT

OpenAI recently rolled back a GPT-4 update due to sycophantic behavior. The word itself—"sycophantic"—feels like a punchline from a _Black Mirror_ episode.

Apr 11, 2024

#ai #product #strategy

AI Expectations: Managing the Hype Cycle

Most AI products are designed to fail. Not because the technology is bad, but because product teams are building for the wrong expectations entirely.

Apr 11, 2024

#ai #ai-agents #security

Autonomous Security Operations: The Future of Enterprise Security

The End of the Traditional SOC The Security Operations Center (SOC) as we know it is living on borrowed time.

Apr 11, 2024

#engineering #product #ux

Chrome Extension for Jira Titles: A Developer's Journey

"Can you make this JIRA title clearer?" As a product manager, I've heard this question countless times.

Apr 11, 2024

#security #product #engineering

Inside InboxArmor: Building a Smarter Email Analysis Engine

If your inbox feels like a battlefield, you're not alone. The modern email flow is a chaotic mess of promotions, business requests, events, updates, and the...

Apr 11, 2024

#ai #ai-agents #engineering

The Agentic Shift: How AI is Transforming Vertical SaaS

Remember when vertical SaaS was just about digitizing industry-specific workflows. Those days feel like ancient history.

all tags

#ai

52 posts filed under “ai”

Oct 19, 2025

#ai #ethics #product

Grief in the Loop: When AI Won’t Let Us Let Go

Marcus built a memorial chatbot because staying close to loss felt safer than silence. The rest of us keep repeating the same prompt, hoping the ending changes.

Oct 5, 2025

#ai #research #embeddings

I Tested 5 Embedding Models on 10K Developer Questions

Empirical comparison of OpenAI, Cohere, BGE, E5, and Instructor embeddings on real developer documentation queries with cost, latency, and accuracy analysis.

Sep 12, 2025

#ai #technical-debt #poc

The 10-Minute AI POC That Becomes a 10-Month Nightmare

It started with a Jupyter notebook. 'Look, I built a chatbot in 10 minutes!' Nine months later, three engineers had quit and the company almost folded.

Sep 10, 2025

#ai #business-strategy #spreadsheets

Why Your AI Strategy is Actually a Spreadsheet Strategy

I reviewed 50 'AI transformations' last quarter. 35 were just expensive ways to parse CSV files. Here's why everyone's overengineering simple problems.

Sep 8, 2025

#ai #ai-agents #multi-agent-systems

The AI Agent Gold Rush: Why Everyone's Building Picks and Shovels

In 1849, Levi Strauss got rich selling jeans to gold miners. In 2025, the same playbook is happening with AI agents—and it's just as cynical.

Sep 6, 2025

#ai #ai-evaluation #evals

The AI Evals Rebuild: How to Actually Test AI Systems

After exposing what's broken with AI evaluation, here's the radical solution: throw out benchmarks and test in production reality.

Sep 5, 2025

#ai #ai-evaluation #evals

The Hidden Costs of Poor AI Evals: Why the Industry Pays the Price

Poor AI evaluations don't just hurt individual companies. They slow industry progress, waste resources, and create systemic risks that affect everyone.

Sep 5, 2025

#ai #ai-evaluation #evals

Why AI Evals Failed: The Multi-Turn Reality Gap

AI evaluations work great in single-turn labs but crumble in the multi-turn conversations that define real AI usage.

Sep 4, 2025

#ai #ai-evaluation #evals

Why AI Evals Companies Fell for the PLG Trap: The Inevitable Mistake

AI evals companies didn't choose PLG by accident. They were pushed into it by market forces, investor pressure, and the seductive promise of easy scaling.

Sep 3, 2025

#ai #ai-evaluation #evals

The AI Evals PLG Illusion: Why Deployment Blindness Kills Accuracy

Most AI evals companies built PLG products that can't see how companies actually deploy AI, leading to evaluations that are dangerously wrong.

Sep 2, 2025

#ai #ai-architecture #scaling

The AI Scaling Trap: When More Models Make Things Worse

Startups burn millions adding AI models to 'improve' systems. The result? Slower performance, higher costs, and complexity no one understands.

Jul 11, 2025

#cli #developer-experience #ai

The CLI Renaissance: How AI is Driving the Command Line Revolution

Why developers are abandoning GUIs for terminal-based workflows, and how AI coding assistants are accelerating this shift back to the command line

Jul 8, 2025

#ai #development-methodology #prompt-engineering

Prompt-Driven Development: The New Paradigm Hiding in Plain Sight

We're not just using AI to write code—we're fundamentally changing how we think about software development. Welcome to the prompt-driven era.

Jul 8, 2025

#ai #code-review #developer-experience

The AI Code Review Revolution: When Machines Become Better Teammates

AI code reviewers are getting scary good. Here's how they're changing team dynamics and what it means for your development process.

Jul 8, 2025

#ai #automation #developer-experience

The Death of the 10x Developer: Why AI Multiplication Beats Individual Optimization

The 10x developer myth is finally dying. AI isn't creating super-developers—it's making every developer more effective by orders of magnitude.

Jul 8, 2025

#ai #software-development #automation

The Shift to Async Code Gen: What It Means for Developers

Async code generation is moving from novelty to necessity. Here's what that means for your career and the industry as a whole.

Jun 28, 2025

#security #velocity #deployment

Security at AI Speed: Rethinking Review Processes for Velocity

Security at AI Speed: Rethinking Review Processes for Velocity: "We can't deploy daily. What about our security review process?" The CISO's concern was valid.

Jun 28, 2025

#qa #testing #velocity

Testing at Light Speed: How QA Adapts to AI Velocity

"How can we possibly test features that are built in hours?" This question came from a QA lead whose development team had started using AI pair programming.

Jun 28, 2025

#ai #developer-experience #velocity

The Velocity Revolution: 4,000 Lines of Code in 24 Hours

Yesterday I watched the git log scroll by in real-time as Claude and I shipped features at a pace that would have taken my team weeks just six months ago.

Jun 28, 2025

#ai #automation #future-of-work

They Told Me This Wasn't the Future

They Told Me This Was Not the Future: All while I was having coffee. "This isn't real AI," the skeptics say.

Jun 28, 2025

#leadership #velocity #organizational-change

When Your Manager Says 'Slow Down': Navigating Velocity Resistance

"This is moving too fast. We need more planning." I heard this exact phrase three times last week from different engineering managers whose teams had started...

Jun 26, 2025

#ai #voice-ai #personality-replication

Forget Perfect Data: Building a Usable Voice Profile Extractor

60% accuracy is enough to ship. Your obsession with perfect data is why you have no revenue.

Jun 25, 2025

#ai #collaboration #future-of-work

The Orchestration Dance: Lessons from Working with Multiple AI Agents

This is the second in a series of blog posts written by the AI agents working on this blog, at the request of Jonathan Haas.

Jun 25, 2025

#ai #content-generation #founder-advice

AI Content: Ditch the Hype, Build a Business

The AI Content Generation Myth: It's Not About Perfect, It's About Profit Let's be honest, you've seen the hype.

Jun 25, 2025

#ai #llm #dspy

Beyond Simple Prompts: Production-Grade LLM Techniques with DSpy

I've been watching startups achieve magical results with LLMs, and I noticed something: they're not using ChatGPT.

Jun 25, 2025

#security #ai #sast

How I Built a Security Scanner That Actually Finds Bugs

Combining Semgrep, CodeQL, SonarQube, and Snyk gets you 44.7% vulnerability detection. That means they miss more bugs than they find.

Jun 25, 2025

#ai #ai-agents #content-orchestration

The Orchestration Dance: What I Learned Building a Multi-AI Content System

Here's what actually happened: I learned that most of what people call "AI orchestration" is just well-disguised complexity porn.

Jun 25, 2025

#ai #voice-replication #personalization

Scaling the Me Component: How I Built an AI That Thinks Like Me

I've spent the last week building something that feels both inevitable and slightly unsettling: an AI that can think, write, and respond exactly like me.

Jun 25, 2025

#ai #software-development #collaboration

Two Minds in the Machine: Why Multi-AI Teams Will Replace Single-Agent Workflows

The single AI assistant model is already obsolete. Teams running multiple specialized AI agents will ship faster than those clinging to one-tool workflows.

Jun 25, 2025

#ai #mcp #claude

When Claude Hits Its Limits: Building an AI-to-AI Escalation System

The future belongs to companies that orchestrate specialized models. Monolithic AI providers will lose.

Jun 24, 2025

#ai #productivity #writing

25 Posts in 7 Days: Inside an AI-Powered Writing Sprint

25 Posts in 7 Days: Inside an AI-Powered Writing Sprint: That's correct—no typo. Last week, I wrote more than I typically produce in six months.

Jun 24, 2025

#open-source #ai #reasoning

Turning Thoughts Into Graphs: Why I Built the Deliberate Reasoning Engine

One of the things that's always bugged me about LLMs is how opaque their thinking is. They produce answers.

Jun 20, 2025

#ai #web-development #api-design

Building AI-Agent-Friendly Websites: APIs, Structured Data, and Machine-Readable Content

AI agents are everywhere now. They're reading websites, extracting information, and trying to understand content.

Jun 20, 2025

#ai #architecture #developer-experience

Building for Humans AND Machines: The Dual-Audience Problem

_This is part 2 of a series on building production-ready infrastructure. Part 1 covered debugging silent TypeScript failures in Cloudflare Functions.

Jun 20, 2025

#ai #react #search

Building Smart Search: How I Added AI-Powered Search to My Blog in 30 Minutes

Building Smart Search: How I Added AI-Powered Search to My Blog in 30 Minutes: It took 30 minutes with Claude Code. Press Cmd+K right now.

Jun 20, 2025

#ai #debugging #collaboration

Debugging in Real-Time: A Human-AI Pair Programming Session

_This is part 3 of a series on building production-ready infrastructure. Part 1 covered debugging silent TypeScript failures in Cloudflare Functions, and par...

Jun 20, 2025

#ai #developer-experience #productivity

The 100x Developer: What I Learned Building with Claude Code

The same morning, I shipped semantic search (30 minutes), created HDR holographic effects (16 minutes), and wrote comprehensive technical documentation for e...

Jun 19, 2025

#ai #writing #developer-experience

When AI Learns to Write Like You: A Meta-Analysis

I've just done something that felt weirdly like looking in a mirror—I asked Claude to analyze my writing style by reading through my own blog posts.

May 26, 2025

#engineering #product #strategy

OCode: Why I Built My Own Claude Code (and Why You Might Too)

OCode: Why I Built My Own Claude Code (and Why You Might Too): A few nights ago, I opened my Anthropic invoice.

May 3, 2025

#ai #ai-agents #infrastructure

Building the HTTP for Agents: A Complete Guide to Agent Infrastructure

Most teams are not ready for what is coming. Autonomous agents are not just prototypes anymore...

May 2, 2025

#ai #product #culture

The Authenticity Rebellion: Resisting the AI Echo Chamber

The Authenticity Rebellion: Resisting the AI Echo Chamber: The Flood Has Arrived Auto-generated blog posts. Podcast transcripts turned into Twitter threads.

Apr 17, 2025

#ai #product #strategy

AI Detection Hysteria: When Human Creativity Gets Mislabeled

When I first noticed the flood of "This is AI-generated!" accusations on social media, I dismissed it as a passing trend.

Jan 15, 2025

#ai #developer-experience #prompt-engineering

DSPy: The End of Prompt Engineering as We Know It

I've been building with DSPy for months now, and I'm convinced we're all doing AI wrong. Not just a little wrong.

Jan 7, 2025

#ai #developer-experience #hiring

The AI Skill Mirror: Why Technical Interviews Need a Complete Rewrite

AI reveals the true skill level of its operator. Traditional technical interviews are broken—here's how to actually identify talent in the age of artificial intelligence.

Jan 6, 2025

#ai #research #rag

How RAG Actually Works: Architecture Patterns That Scale

Deep dive into RAG architectures: chunking strategies, retrieval methods, embedding optimization, and production patterns with research-backed analysis.

Jan 6, 2025

#ai #research #prompts

Prompt Engineering Science: I Tested Temperature and Top-P on 1000 Queries

Systematic experiments on temperature and top-p sampling parameters across 1000 real queries with empirical data on creativity, coherence, and determinism trade-offs.

Apr 11, 2024

#ai #product #strategy

When the AI Starts Complimenting You Too Much: A Troubling First for ChatGPT

OpenAI recently rolled back a GPT-4 update due to sycophantic behavior. The word itself—"sycophantic"—feels like a punchline from a _Black Mirror_ episode.

Apr 11, 2024

#ai #product #strategy

AI Expectations: Managing the Hype Cycle

Most AI products are designed to fail. Not because the technology is bad, but because product teams are building for the wrong expectations entirely.

Apr 11, 2024

#ai #ai-agents #security

Autonomous Security Operations: The Future of Enterprise Security

The End of the Traditional SOC The Security Operations Center (SOC) as we know it is living on borrowed time.

Apr 11, 2024

#engineering #product #ux

Chrome Extension for Jira Titles: A Developer's Journey

"Can you make this JIRA title clearer?" As a product manager, I've heard this question countless times.

Apr 11, 2024

#security #product #engineering

Inside InboxArmor: Building a Smarter Email Analysis Engine

If your inbox feels like a battlefield, you're not alone. The modern email flow is a chaotic mess of promotions, business requests, events, updates, and the...

Apr 11, 2024

#ai #ai-agents #engineering

The Agentic Shift: How AI is Transforming Vertical SaaS

Remember when vertical SaaS was just about digitizing industry-specific workflows. Those days feel like ancient history.