Autonomous AI coding doesn't fail because models aren't smart enough. It fails because we give them too much context, vague goals, and no hard definition of success. The Ralph Wiggum approach flips that on its head. Short contexts, brutally clear tasks, hard completion signals, and relentless...
The next AI breakthrough won't be smarter models but reliable ones. Like self-driving cars, progress of AI agents means consistency over demos. The future of AI lies in disciplined software engineering, building agents that work safely and predictably every time, not just sometimes.
Large language models (LLMs) and AI agents that use them often get lauded as magic. But anyone using them in production or serious applications quickly learns how often they fail. What we commonly call hallucinations, workslop, or vulnerabilities are not random bugs. They tend to cluster into...
There’s been a lot of recent buzz around whether Retrieval Augmented Generation (RAG) has reached its limits. Is RAG truly dead or just replaced by new approaches like search agents, MCP, or massive context windows? That's still retrieval-augmented generation under a new name.
Most AI models aren't generalists—they're specialists. With over 200,000 LLMs available, choosing just one model for your AI product won't cut it. Enter LLM routers - technology that routes each task to the model best suited to handle it. Discover how routers can cut costs by 85%, improve spe...
AI search apps like Perplexity.ai are really cool. They use an LLM to answer your questions, but pull in real-time search results to augment the answer (eg RAG) and list citations. I wanted to know how it works and decided to build my own version. Getting it to work is surprisingly simple and...
On a cold evening in Paris in 2008, Travis Kalanick and Garrett Camp couldn't get a cab. Fast forward to today where Uber is the largest mobility platform in the world, in over 70 countries, 10,500 cities, and 130 million users. Here's the story of the key moments in scaling Uber engineering ...
I revisited the look of early web design by recreating my home page with a 90s theme. Drawing inspiration from iconic sites like Space Jam, I incorporated classic elements such as gif badges, pixelated fonts, and dithered images to capture the era's distinct look.
I explored using OpenAI's gpt-3 language models to generate fantasy football player outlooks for my Draft Punk app. Through experimentation with different models, retrieval-augmented generation, and prompt engineering, I learned how to create compelling, fact-based player narratives that riva...
You don't grow crops by yelling at plants and you don't grow team velocity by pressuring engineers. Great managers prepare the ground. Give clarity, tools, and trust. Focus on support, not blame. Remove blockers, nurture growth, match people to problems, and improve yourself first.