Why (Senior) Engineers Struggle to Build AI Agents — Philipp Schmid, Google DeepMind
TL;DR
Agents change the job from control to delegation — Schmid says traditional software engineers acted like traffic controllers, but with agents you’re a dispatcher who sets the destination and lets the model choose the route, even when the path looks strange.
Text is the new state layer — Instead of rigid booleans and flags, agents work on semantic context, like approving a deep research plan while adding “focus on the US market and ignore California” in one step.
Errors should be fed back into the loop, not restart it — If a 5- or 15-minute agent run fails, starting over wastes compute and loses context, so errors need to be treated like inputs the model can recover from.
Unit tests aren’t enough; reliability comes from evals — Because the same prompt can produce different outputs, the real question is not whether input A always yields C, but how often the agent succeeds well enough to trust in production.
APIs built for humans often fail agents — A method like
delete itemmay feel obvious to backend engineers, but agents only see function schemas and docstrings, so tools need explicit, self-documenting semantic interfaces.Build for replacement, not permanence — Schmid closes with the “bitter lesson” that agent software is disposable and will be rebuilt repeatedly as better models and agent patterns arrive.
The Breakdown
Senior engineers trip over AI agents because the mental model is backwards: you’re no longer a traffic controller scripting every move, you’re a dispatcher setting goals and recovering when the model takes weird-but-effective paths. Philipp Schmid from Google DeepMind lays out five concrete shifts—from treating text as state to replacing unit tests with evals—that explain why traditional software habits break down.
Was This Useful?
Share
Keep Reading
Make Alcreon Yours
Tune your feedFive quick questions, and the feed ranks what matters to you first.Or just get notified
The weekly Echo. Signal worth keeping in your inbox.
Every new piece, announced on X.
Read Next
See all
Playbook
Tasteful Skills
“Tasteful Skills” argues that the best agent skills are not documentation or best-practice lists.

Playbook
The Art of Tasteful Prompting
Learn how tasteful prompting helps you move beyond generic AI output by shaping context, style, and judgment from the start.

Playbook
The Codex /goal Playbook
OpenAI shipped /goal for the Codex CLI. It turns a prompt into a persisted, self-continuing contract.