codeflash-agent/docs/design.md at main

codeflash-admin/codeflash-agent

Fork 0

mirror of https://github.com/codeflash-ai/codeflash-agent.git synced 2026-05-04 18:25:19 +00:00

Kevin Turcios 3b59d97647 squash

2026-04-13 14:12:17 -05:00

8.4 KiB

Raw Permalink Blame History

1. Treat the harness as first-class product IP

The orchestrator is the product. Invest in:

context selection
task planning
tool descriptions
retries and recovery
permission policies
durable state and memory
evaluation loops

2. Long-running agents need explicit state management

If an agent will span many turns or run in the background, it cannot rely on raw transcript accumulation. It needs:

compact task state
durable artifacts and handoff files
summarized history
selective retrieval of only relevant prior work

3. Safety needs multiple layers

The practical stack is not one feature. It is a combination of:

conservative defaults
scoped permissions
sandboxing where possible
action classification
audit logs
destructive-action testing
prompt-injection defenses

4. Local agents create real endpoint risk

A coding agent with shell and filesystem access is effectively privileged software. That means release hygiene matters:

do not ship source maps in production artifacts
scan release bundles before publish
use artifact signing / attestation
minimize local plaintext retention where possible
document what is logged, where, and why

How to Be Effective with Context Engineering

Anthropic defines context engineering as curating and maintaining the right set of tokens and state around a model invocation, not just writing a better prompt. For an agentic CLI, the practical meaning is simpler: the system should always provide the model with enough context to take the next correct action, but not so much that it becomes distracted, expensive, or unsafe.

A more useful working definition

For a coding agent, context is not just the system prompt. It is the full operating environment:

the active task and constraints
the current plan and stopping condition
the relevant files, symbols, and diffs
the available tools and their contracts
the recent observations from shell commands and tests
durable memory from earlier work
the policy boundary around permissions and risky actions

If any of those are missing, stale, or too noisy, agent quality drops fast.

The context stack a coding CLI should manage

Treat context as a layered stack, not a single blob:

Stable policy layer
The non-negotiables: system rules, tool permissions, repo conventions, sandbox limits, output style, and safety constraints.
Task layer
The user's request, the success condition, assumptions, and explicit non-goals. This should be short and durable.
Working-state layer
The current plan, what has already been tried, what remains blocked, and which files or services are in scope.
Evidence layer
The actual code snippets, command results, test failures, stack traces, and docs needed for the next decision.
Memory layer
Reusable facts worth carrying across turns, such as build quirks, repo-specific commands, and previous failed approaches.

Most agent failures happen when these layers are mixed together without discipline.

Opinionated rules for agent and CLI design

1. Keep the task state outside the transcript

Do not rely on the model to infer the current plan from chat history. Persist a compact state object or artifact containing:

the objective
current step
files in scope
known constraints
open questions
last meaningful result

The transcript is a bad database. Use it for conversation, not state recovery.

2. Retrieve code narrowly and late

Do not dump entire files or directories into context by default. Retrieve only what the next step needs:

a specific symbol
a failing test
a diff hunk
a bounded file region
a targeted doc excerpt

Broad retrieval creates distraction and raises token cost without improving decisions.

3. Summarize after every expensive step

After a search pass, test run, or multi-command investigation, convert the result into a short structured summary before moving on. Good summaries should capture:

what was learned
what changed
what remains uncertain
what the next action should be

This keeps the working set fresh and prevents context drift across long sessions.

4. Design tools to return decision-ready output

Tool output should help the model choose the next action, not force it to parse noise. Prefer:

concise command output
bounded file reads
explicit exit codes
normalized error messages
machine-parseable fields where possible

If a tool returns pages of raw text, the tool is poorly designed for agent use.

5. Make memory write-worthy, not chatty

Persistent memory should be rare and high-value. Store only facts that are likely to matter later, such as:

the right test command for this repo
a non-obvious setup requirement
a dangerous directory or workflow to avoid
a service dependency that causes common failures

Do not store transient observations that belong in the current task state only.

6. Separate planning context from execution context

The model needs different context when deciding what to do than when editing a file or running a command. A good CLI can tighten the context window for execution:

include only the target file and local constraints for edits
include only the exact command intent and safety policy for shell execution
include only the relevant failure output for debugging

This reduces accidental spillover from stale earlier reasoning.

7. Build explicit stop conditions

Agents burn time when they do not know when to stop. Every substantial task should carry one of these end states:

requested change implemented
tests passing or best-available verification complete
blocked on missing permission or missing information
unsafe to continue without user confirmation

Without a stop condition, context engineering degrades into aimless looping.

Common failure modes to design against

These are the recurring context failures in coding agents:

Context poisoning: irrelevant logs, stale plans, or old diffs dominate the prompt.
Context starvation: the model is asked to act without the relevant file region, command result, or policy detail.
Context collision: instructions from different phases conflict, such as planning guidance leaking into final output formatting.
Context amnesia: the agent forgets prior discoveries because nothing durable was written down.
Context bloat: every turn carries too much history, so quality drops and latency rises.

Your CLI should have explicit mechanisms to detect and correct each of these.

A tactical operating loop

For a coding agent, a strong default loop looks like this:

Restate the goal and define success.
Gather only the minimum code and repo context needed to choose the next step.
Write or update compact task state.
Execute one meaningful action.
Summarize the result into durable working state.
Prune stale context before the next step.
Stop as soon as the success condition or block condition is reached.

This is the operational core behind most reliable agent behavior.

What the Claude Code leak suggests here

The leak matters because it reinforces that strong coding agents are mostly a context-management problem wrapped around a model:

permission logic is context engineering
tool orchestration is context engineering
background execution is context engineering
memory and handoff artifacts are context engineering
safety boundaries are context engineering

That is the practical takeaway: do not hunt for a magic prompt. Build a system that keeps the right context available at the right time.

Practical Takeaways

If the goal is to design a strong agentic CLI, the combined lesson is:

Do not over-focus on prompt wording.
Invest in context assembly, memory, tool quality, and evaluations.
Keep the architecture simple until complexity is justified.
Treat local execution and packaging as security-sensitive.
Treat context as core infrastructure, not support work.

8.4 KiB Raw Permalink Blame History