#ai

28 posts

Mar 26, 2026
The Real Work of Orchestrating AI Coding Agents
Three concurrent coding agents taught me the actual bottleneck: not prompting, but assignment, evidence, review, and release control.
Mar 24, 2026
Building Kestrel: A Context-Aware AI Desktop Assistant in One Session
How I built a full LittleBird clone with screen context reading, meeting recording, arena mode, and MCP tool support — from scratch to packaged .app in a single coding session.
Sep 12, 2025
The 10-Minute AI POC That Becomes a 10-Month Nightmare
Five lines of Python and an API key produce a working demo. The gap between that demo and a production system contains failure modes the prototype...
Sep 10, 2025
Why Your AI Strategy is Actually a Spreadsheet Strategy
Most enterprise AI transformations are solving problems that spreadsheets handle at 1/50th the cost. The misalignment is driven by career incentives,...
Sep 8, 2025
The AI Agent Gold Rush: Why Everyone's Building Picks and Shovels
Most AI agent infrastructure is premature. The agents themselves barely work. The industry is selling Formula 1 equipment to people still learning to...
Jul 11, 2025
The CLI Renaissance: How AI is Driving the Command Line Revolution
AI coding assistants output shell commands, not GUI instructions. That single fact is reversing a decade of developer tooling trends.
Jul 8, 2025
Useful AI Code Review Needs Product Context
AI review only becomes valuable when it can reason about behavior, blast radius, user impact, and the evidence required to trust a change.
Jul 8, 2025
Async Code Gen Turns Engineers Into Operators
Async code generation is delegated execution. The new work is task design, review, evidence, and deciding what the system is allowed to ship.
Jul 8, 2025
The Death of the 10x Developer: Why AI Multiplication Beats Individual Optimization
AI commoditized the pattern recognition and architectural intuition that made 10x developers valuable. The bottleneck moved from individual output to...
Jun 28, 2025
Testing at Light Speed: How QA Adapts to AI Velocity
AI-generated code produces different bugs than human-written code. QA built for syntax checking is testing for the wrong failures.
Jun 26, 2025
Forget Perfect Data: Building a Usable Voice Profile Extractor
I shipped a voice profile extractor at 60% accuracy. Simple pattern matching outperformed ML for writing voice replication.
Jun 25, 2025
When Claude Hits Its Limits: Building an AI-to-AI Escalation System
Different LLMs have different strengths. Routing tasks to the right model -- like heterogeneous compute -- turns out to be more valuable than using one ...
Jun 25, 2025
What Actually Failed Building a Multi-AI Content System
I built a multi-AI content pipeline combining Gemini and Claude. The failures taught me more than the architecture.
Jun 25, 2025
Scaling the Me Component: How I Built an AI That Thinks Like Me
I built a voice replication system by extracting patterns from my blog corpus. Here's what it captures, what it misses, and what that reveals about...
Jun 25, 2025
Prompts Are Software. Treat Them Like It.
Production AI teams do not win by hand-tuning clever prompts. They version, evaluate, optimize, and observe behavior like software.
Jun 25, 2025
How I Built a Security Scanner That Actually Finds Bugs
Combining Semgrep, CodeQL, SonarQube, and Snyk gets you 44.7% vulnerability detection. Semantic SAST combines Tree-sitter with LLM reasoning to do better.
Jun 25, 2025
Shared Context Is the Real Multi-Agent Primitive
Multiple agents do not need a shared brain. They need explicit context, durable memory, and a record of why the project works the way it does.
Jun 20, 2025
Building for Humans AND Machines: The Dual-Audience Problem
Every web design decision now must serve two audiences: humans who browse visually and AI agents that consume data programmatically. The architectural...
Jun 19, 2025
When AI Learns to Write Like You: A Meta-Analysis
I asked Claude to analyze my writing style across my blog posts. The patterns it found -- and the ones I didn't know I had -- were genuinely surprising.
May 26, 2025
OCode: Why I Built My Own Claude Code (and Why You Might Too)
OCode: Why I Built My Own Claude Code (and Why You Might Too): A few nights ago, I opened my Anthropic invoice.
May 3, 2025
Agent Infrastructure Starts With Identity, Policy, and Audit
Autonomous agents need a control plane: identity, policy, secrets, and audit trails that make delegated work governable.
Apr 17, 2025
AI Detection Hysteria: When Human Creativity Gets Mislabeled
A photographer friend posted a sunset photo after three hours of waiting for the perfect light. Within minutes: 'Obvious Midjourney.' 'Nice prompt, bro.'
Jan 7, 2025
The AI Skill Mirror: Why Technical Interviews Need a Complete Rewrite
AI doesn't make everyone equally skilled. It amplifies existing ability. That changes what technical interviews should test.
Jan 6, 2025
How RAG Actually Works: Architecture Patterns That Scale
Deep dive into RAG architectures: chunking strategies, retrieval methods, embedding optimization, and production patterns with research-backed analysis.
Jan 6, 2025
Prompt Engineering Science: I Tested Temperature and Top-P on 1000 Queries
Systematic experiments on temperature and top-p sampling parameters across 1000 real queries with empirical data on creativity, coherence, and...
Apr 11, 2024
When the AI Starts Complimenting You Too Much: A Troubling First for ChatGPT
OpenAI recently rolled back a GPT-4 update due to sycophantic behavior. The word itself--'sycophantic'--feels like a punchline from a _Black Mirror_...
Apr 11, 2024
AI Expectations: Managing the Hype Cycle
Most AI products are designed to fail. Not because the technology is bad, but because product teams are building for the wrong expectations entirely.
Apr 11, 2024
Chrome Extension for Jira Titles: A Developer's Journey
I kept writing terrible JIRA titles during customer calls. So I built a Chrome extension to fix it.

#ai

28 posts

Mar 26, 2026
The Real Work of Orchestrating AI Coding Agents
Three concurrent coding agents taught me the actual bottleneck: not prompting, but assignment, evidence, review, and release control.
Mar 24, 2026
Building Kestrel: A Context-Aware AI Desktop Assistant in One Session
How I built a full LittleBird clone with screen context reading, meeting recording, arena mode, and MCP tool support — from scratch to packaged .app in a single coding session.
Sep 12, 2025
The 10-Minute AI POC That Becomes a 10-Month Nightmare
Five lines of Python and an API key produce a working demo. The gap between that demo and a production system contains failure modes the prototype...
Sep 10, 2025
Why Your AI Strategy is Actually a Spreadsheet Strategy
Most enterprise AI transformations are solving problems that spreadsheets handle at 1/50th the cost. The misalignment is driven by career incentives,...
Sep 8, 2025
The AI Agent Gold Rush: Why Everyone's Building Picks and Shovels
Most AI agent infrastructure is premature. The agents themselves barely work. The industry is selling Formula 1 equipment to people still learning to...
Jul 11, 2025
The CLI Renaissance: How AI is Driving the Command Line Revolution
AI coding assistants output shell commands, not GUI instructions. That single fact is reversing a decade of developer tooling trends.
Jul 8, 2025
Useful AI Code Review Needs Product Context
AI review only becomes valuable when it can reason about behavior, blast radius, user impact, and the evidence required to trust a change.
Jul 8, 2025
Async Code Gen Turns Engineers Into Operators
Async code generation is delegated execution. The new work is task design, review, evidence, and deciding what the system is allowed to ship.
Jul 8, 2025
The Death of the 10x Developer: Why AI Multiplication Beats Individual Optimization
AI commoditized the pattern recognition and architectural intuition that made 10x developers valuable. The bottleneck moved from individual output to...
Jun 28, 2025
Testing at Light Speed: How QA Adapts to AI Velocity
AI-generated code produces different bugs than human-written code. QA built for syntax checking is testing for the wrong failures.
Jun 26, 2025
Forget Perfect Data: Building a Usable Voice Profile Extractor
I shipped a voice profile extractor at 60% accuracy. Simple pattern matching outperformed ML for writing voice replication.
Jun 25, 2025
When Claude Hits Its Limits: Building an AI-to-AI Escalation System
Different LLMs have different strengths. Routing tasks to the right model -- like heterogeneous compute -- turns out to be more valuable than using one ...
Jun 25, 2025
What Actually Failed Building a Multi-AI Content System
I built a multi-AI content pipeline combining Gemini and Claude. The failures taught me more than the architecture.
Jun 25, 2025
Scaling the Me Component: How I Built an AI That Thinks Like Me
I built a voice replication system by extracting patterns from my blog corpus. Here's what it captures, what it misses, and what that reveals about...
Jun 25, 2025
Prompts Are Software. Treat Them Like It.
Production AI teams do not win by hand-tuning clever prompts. They version, evaluate, optimize, and observe behavior like software.
Jun 25, 2025
How I Built a Security Scanner That Actually Finds Bugs
Combining Semgrep, CodeQL, SonarQube, and Snyk gets you 44.7% vulnerability detection. Semantic SAST combines Tree-sitter with LLM reasoning to do better.
Jun 25, 2025
Shared Context Is the Real Multi-Agent Primitive
Multiple agents do not need a shared brain. They need explicit context, durable memory, and a record of why the project works the way it does.
Jun 20, 2025
Building for Humans AND Machines: The Dual-Audience Problem
Every web design decision now must serve two audiences: humans who browse visually and AI agents that consume data programmatically. The architectural...
Jun 19, 2025
When AI Learns to Write Like You: A Meta-Analysis
I asked Claude to analyze my writing style across my blog posts. The patterns it found -- and the ones I didn't know I had -- were genuinely surprising.
May 26, 2025
OCode: Why I Built My Own Claude Code (and Why You Might Too)
OCode: Why I Built My Own Claude Code (and Why You Might Too): A few nights ago, I opened my Anthropic invoice.
May 3, 2025
Agent Infrastructure Starts With Identity, Policy, and Audit
Autonomous agents need a control plane: identity, policy, secrets, and audit trails that make delegated work governable.
Apr 17, 2025
AI Detection Hysteria: When Human Creativity Gets Mislabeled
A photographer friend posted a sunset photo after three hours of waiting for the perfect light. Within minutes: 'Obvious Midjourney.' 'Nice prompt, bro.'
Jan 7, 2025
The AI Skill Mirror: Why Technical Interviews Need a Complete Rewrite
AI doesn't make everyone equally skilled. It amplifies existing ability. That changes what technical interviews should test.
Jan 6, 2025
How RAG Actually Works: Architecture Patterns That Scale
Deep dive into RAG architectures: chunking strategies, retrieval methods, embedding optimization, and production patterns with research-backed analysis.
Jan 6, 2025
Prompt Engineering Science: I Tested Temperature and Top-P on 1000 Queries
Systematic experiments on temperature and top-p sampling parameters across 1000 real queries with empirical data on creativity, coherence, and...
Apr 11, 2024
When the AI Starts Complimenting You Too Much: A Troubling First for ChatGPT
OpenAI recently rolled back a GPT-4 update due to sycophantic behavior. The word itself--'sycophantic'--feels like a punchline from a _Black Mirror_...
Apr 11, 2024
AI Expectations: Managing the Hype Cycle
Most AI products are designed to fail. Not because the technology is bad, but because product teams are building for the wrong expectations entirely.
Apr 11, 2024
Chrome Extension for Jira Titles: A Developer's Journey
I kept writing terrible JIRA titles during customer calls. So I built a Chrome extension to fix it.