Agent Readiness

    Agents are an environment problem, not a hiring problem. The quality of agent output is bounded by the environment the agent operates in—what it can see, what it can do, how it knows whether it worked, and what survives between sessions. Treating an agent as a new employee to be onboarded with documentation leads to a predictable dead end. Treating it as a system to be architected with context, tools, and verification leads somewhere productive.

    This reframe matters because it relocates effort. The common instinct when an agent underperforms is to communicate harder: write a longer prompt, add another markdown file, build another skill. That instinct produces three recurring artifacts that signal the wrong knob is being turned—prompt perfectionism (endlessly tuning wording), docs graveyards (large .md collections that go stale faster than the code they describe), and skills graveyards (dozens of single-use capabilities that accumulate as cruft). All three are symptoms of the same error: trying to communicate harder instead of removing the need to communicate.

    The alternative is to engineer an environment where the agent does not need to be told. A repository that verifies any change in seconds, describes its own setup, mechanizes its standards, and bounds the blast radius of mistakes does not require a perfect prompt—it makes the right action the easy action. This chapter develops that shift into a working vocabulary, a set of principles, and diagnostic ladders.


    The Reframe

    The conventional question and the productive question differ by one word, and that word changes everything downstream.

    Conventional framing Readiness framing
    "How do I tell the agent more clearly?" "How do I build an environment where it doesn't need to be told?"
    Agent quality is a property of the prompt Agent quality is a property of the surrounding surfaces
    Failures mean the instructions were unclear Failures map to specific, namable environment gaps
    Standards live in a style guide humans read Standards live in checks that fail builds
    Progress means more documentation Progress means more verification and better tools

    The conversion moment for most practitioners is experiential rather than argued. The typical version is some variation of having spent many hours building a docs/ directory that did not help. Once that cost has been paid on the wrong knob, the reframe lands immediately; before it, the reframe tends to bounce.


    Sections

    Section Description
    The Four Surfaces Context, tools, verification, and memory as the surfaces that determine agent quality, plus the leverage hierarchy from prompt phrasing to cross-session memory
    Failure Modes A shared triage vocabulary—context starvation, context poisoning, verification gap, tool poverty, spec drift, lossy handoff—mapped to root surfaces and fixes
    Readiness Principles Eight principles of an agent-ready environment, plus a compact five-point frame that captures them
    Dimensions of Readiness Readiness resolved into independent dimensions—individual practice, the environment, and the organization—for locating where it actually stands
    Ratcheting and Legacy How brownfield repositories become agent-ready by improving monotonically rather than by rewriting
    Software Ecology How readiness scales from one repository to the whole socio-technical ecosystem, and what breaks across the delivery pipeline at 10x agent activity

    Connections

    • Context: Context is the first of the four surfaces. The starvation and poisoning failure modes are context failures, and context curation sits near the base of the leverage hierarchy.
    • Tool Use: Tools are the surface that determines what an agent can act on. Tool poverty—no skill for a repeated task, no integration for the upstream system—is a readiness gap, not a model limitation.
    • Harnesses: The harness is where verification loops, permissions, and blast-radius controls are encoded. An agent-ready environment is largely a well-engineered harness.
    • Practices: Debugging agents depends on naming failure modes precisely; the readiness vocabulary feeds directly into triage.
    • Long-Horizon Agent State: Memory is the fourth surface and the highest tier of leverage. What survives across sessions is the subject of the next chapter.