OpenAI Codex Worktrees and Parallel Agents Playbook

Worktrees are one of the most important Codex desktop concepts because they turn agent parallelism from chaos into reviewable isolation. Without worktrees, multiple agents editing the same checkout can collide with each other and with the developer’s own changes. With worktrees, each task can run in its own Git-backed copy while the main working tree stays usable.

The point is not to run the maximum number of agents. The point is to run independent work in isolation, then compare the results with normal engineering discipline.

Quick answer

Use Codex worktrees when a task is independent, likely to modify files, or long enough that you do not want it touching your active checkout. Keep the first rollout conservative: two to four parallel agents are usually enough until the team knows its review capacity. More agents create more diffs, more tests, more follow-up decisions, and more cleanup.

When a worktree is worth it

Use a worktree for:

multi-file features;
refactors;
dependency migrations;
UI changes that may need repeated attempts;
bug fixes where you want to preserve your local state;
alternative implementation experiments;
scheduled automations that might edit files;
subagent exploration where each agent owns a different question.

Do not use a worktree just because it sounds sophisticated. For a one-line comment or a small local test change, the local checkout may be simpler.

The right task shape

A good worktree task has:

Element	Example
Objective	”Add CSV export for billing reports”
Boundary	”Stay within reports, billing, and tests unless blocked”
Verification	”Run npm test — billing and npm run typecheck”
Evidence	”Report commands, failures, files changed, and migration risk”
Review path	”Do not commit; leave a clean diff for review”

The worktree does not remove the need for scope. It just gives the agent a safer place to work.

Parallel agent patterns

One implementation, one reviewer agent

Use one Codex thread to implement and a second read-only or review-focused thread to inspect the diff.

Good for:

production fixes;
complex refactors;
security-sensitive changes;
migrations where regressions are easy to miss.

Prompt shape:

Agent A: implement the smallest fix for the failing auth refresh test.
Agent B: independently review Agent A's eventual diff for behavior changes,
security issues, missing tests, and migration risk. Do not edit files.

Multiple alternative implementations

Use two agents when the design choice is unclear.

Good for:

UI architecture choices;
data model migrations;
package replacement decisions;
performance fixes with competing tradeoffs.

The reviewer should compare:

diff size;
test coverage;
migration risk;
maintainability;
performance evidence;
rollback complexity.

Do not merge both. The value is in choosing one stronger path.

Parallel exploration

Use several agents for read-heavy investigation when the problem has independent dimensions:

one agent maps the frontend path;
one maps the API path;
one maps tests and fixtures;
one looks for prior incidents or TODOs.

Then consolidate before any implementation begins. This reduces context pollution and helps the main agent work from sharper evidence.

Review capacity limit

Parallelism is bounded by human review, not model throughput.

Use this simple capacity rule:

Team state	Recommended active write agents per repo
New to Codex	1
Comfortable with reviewable diffs	2
Strong tests and named reviewers	3 to 4
Dedicated developer productivity workflow	More, only with queue policy

If reviewers are already behind, more agents make the system worse. The bottleneck moves from typing code to deciding whether code is safe.

Worktree cleanup

Worktrees consume disk space because each one can have repo files, dependencies, build output, and caches. The official Codex worktrees docs note that Codex manages worktrees and keeps a limited number by default, but teams should still treat cleanup as an operating habit.

Cleanup policy:

Archive threads that are no longer useful.
Pin only worktrees with unresolved value.
Delete failed explorations after extracting learning.
Avoid leaving dependency caches in many stale worktrees.
Write a short decision note before deleting an alternative path.

The goal is not perfect cleanliness. The goal is to avoid an invisible backlog of half-decisions.

Handoff between local and worktree

Codex can move threads between local and worktree modes. Use this intentionally:

start in a worktree when the agent may explore;
hand back to local only after the direction is accepted;
avoid handoff when your local checkout has uncommitted changes that conflict with the agent’s work;
rerun verification after handoff because environment state can differ.

Treat handoff as a merge step, not a magic teleport.

Risks and controls

Risk	Control
Two agents change the same file differently	Assign file ownership in the prompt
Reviewer cannot understand diff intent	Require plan and summary before final review
Worktree uses stale dependencies	Run setup or local environment scripts explicitly
Automation edits active work	Use background worktrees for recurring changes
Too many stale worktrees	Archive or delete after decision
Subagents spend tokens without value	Use subagents only for independent questions

Codex automations playbook Automations can run on worktrees, but only after the task has a durable prompt and safe review loop.

Codex sandboxing and approvals Worktree isolation is not the same thing as full security isolation.

Codex PR gates and evals Use review gates to decide which worktree output should move toward merge.

Source notes

This playbook is based on OpenAI’s Codex worktrees documentation, Codex app features, and subagents documentation.

OpenAI Codex Worktrees and Parallel Agents Playbook

OpenAI Codex Worktrees and Parallel Agents Playbook

Quick answer

When a worktree is worth it

The right task shape

Parallel agent patterns

One implementation, one reviewer agent

Multiple alternative implementations

Parallel exploration

Review capacity limit

Worktree cleanup

Handoff between local and worktree

Risks and controls

Related paths

Source notes