synth-agency-end-to-end

Synth: agency end-to-end smoke — federation byte-equality

Metadata

Statusdone
Assignedagent-2449
Agent identity3184716484e6f0ea08bb13539daf07686ee79d440505f1fdf2de0357707034c3
Modelclaude:opus
Created2026-05-04T17:45:30.726705677+00:00
Started2026-05-04T20:58:10.073607339+00:00
Completed2026-05-04T21:10:20.934887771+00:00
Tagsagency,sync,synthesis,federation, eval-scheduled
Eval score0.87
└ blocking impact0.82
└ completeness0.88
└ coordination overhead0.90
└ correctness0.85
└ downstream usability0.88
└ efficiency0.82
└ intent fidelity0.86
└ style adherence0.90

Description

Description

Final fan-in synthesis. Verify cross-area composition by running an end-to-end smoke that exercises the federation primitive: a workgraph project's agency primitives must be byte-for-byte loadable by agentbureau/agency tooling (or vice versa, per the chosen alignment direction in research-agency-hash-compat).

End-to-end smoke shape

  1. wg init a scratch project, wg agency init to seed primitives.
  2. wg agency export --format agency-csv > /tmp/wg-export.csv.
  3. Take agency's upstream primitives/starter.csv (download from https://raw.githubusercontent.com/agentbureau/agency/main/primitives/starter.csv) and run wg agency import --format agency-csv against it.
  4. Re-export and byte-diff against the upstream — should be empty.
  5. Pick three primitives (one role_component, one desired_outcome, one trade_off_config). Compute their content_hash by hand (sha256 of canonical input). Confirm the hash matches the on-disk .wg/agency/primitives/{components,outcomes,tradeoffs}/{hash}.yaml filename.
  6. Confirm wg agency stats reports v1.2.4 compat version.

Deliverable

wg log synthesis report with:

  • Each step's pass/fail
  • Any drift detected (and whether it's structural or cosmetic)
  • Concrete follow-up tasks filed as wg add --after synth-agency-end-to-end ... for any drift left to close
  • Final verdict: alignment achieved (yes/partial/no)

Validation

  • All 6 smoke steps executed, output captured
  • Byte-diff in step 4 is empty (or differences explained as intentional)
  • Hand-computed hashes in step 5 match on-disk filenames for all three sample primitives
  • Synthesis report posted via wg log with verdict
  • Any residual drift filed as follow-up tasks (or explicit note that none was found)
  • cargo build + cargo test pass

Depends on

Required by

Log