fix-codex-cli

Fix: codex-cli route writes correct tiered defaults + per-role agency overrides

Metadata

Statusdone
Assignedagent-998
Agent identityf51439356729d112a6c404803d88015d5b44832c6c584c62b96732b63c2b0c7e
Modelclaude:sonnet
Created2026-04-28T22:38:58.152051934+00:00
Started2026-04-28T22:51:35.642610665+00:00
Completed2026-04-28T23:06:32.577282998+00:00
Tagspriority-high,fix,codex,config, eval-scheduled
Eval score0.83
└ blocking impact0.90
└ completeness0.75
└ coordination overhead0.85
└ correctness0.88
└ downstream usability0.82
└ efficiency0.82
└ intent fidelity0.94
└ style adherence0.85

Description

Description

Implement the recommendations from research-current-openai. Read that task's log via wg show research-current-openai for the verified codex model strings + tier mapping.

Goal: a user running wg init --route codex-cli (or wg init -m codex:<something>) gets a working all-codex setup — including agency meta-tasks (.evaluate-*, .flip-*, .assign-*) running on codex, not silently falling back to claude:haiku.

Scope

  1. Update the codex-cli route in wg init to write:
    • [agent].model and [dispatcher].model set to the balanced/sonnet-equivalent codex tier
    • [models.evaluator], [models.flip], [models.assigner] set to the cheap/haiku-equivalent codex tier
    • Any other per-role overrides that today default to claude:haiku
  2. If the current codex:o1-pro default is stale, replace it with the current top-tier model name from research.
  3. Add a deprecation/migration path: existing configs with the old defaults should warn (via wg config lint) and wg migrate config should rewrite them.
  4. Document the change: update the codex-cli section of the relevant docs (probably docs/config-ux-design.md or a setup guide) with the new defaults.

Validation

  • Failing test written first: wg init --route codex-cli --dry-run output contains the new [models.*] sections AND the updated [agent]/[dispatcher].model strings
  • Live smoke: in a fresh tmpdir, run wg init --route codex-cli then wg config show — confirm all four model strings are codex-prefixed (no claude: anywhere unless explicitly chosen)
  • Live smoke: run a tiny task end-to-end on this fresh project and confirm the spawned agent uses codex (check wg agents for executor=codex, OR check WG_EXECUTOR_TYPE in the spawned process env)
  • Live smoke: confirm the agency pipeline (.evaluate-* / .flip-*) also runs on codex on this project — not claude:haiku. Inspect the spawned meta-agent's executor.
  • wg config lint flags old codex configs and wg migrate config rewrites them
  • cargo build + cargo test pass with no regressions
  • Permanent smoke scenario added under tests/smoke/scenarios/ with this task id in owners
  • cargo install --path . was run before claiming done

Depends on

Required by

Log