quality-pass-20260427t180000

.quality-pass-20260427T180000

Metadata

Statusdone
Assignedagent-1
Agent identityeea940a6f6be13d60578dee27be1f4bade4fcaab05bbbe54b9c5ef4b2d05eae0
Created2026-04-28T01:47:29.392644989+00:00
Started2026-04-28T01:48:18.877641085+00:00
Completed2026-04-28T01:53:20.730012469+00:00
Tagseval-scheduled
Eval score0.81
└ blocking impact0.90
└ completeness0.85
└ coordination overhead0.80
└ correctness0.80
└ downstream usability0.80
└ efficiency0.85
└ intent fidelity0.77
└ style adherence0.75

Description

Quality Pass: Post-Triage Review

Review and optimize task metadata for the six tasks created as part of the "virtual review session" before they enter execution.

Tasks to review

  • update-tracking-docs
  • stocktake-poietic-pbc
  • independent-review-poietic
  • visual-language-study
  • engage-pr1-review
  • direction-synthesis-virtual

What to do

For EACH task listed above:

1. Classify task type

Read the task via wg show <task-id>. Classify as one of:

  • research — Investigation, analysis, surveying assets
  • implementation — New code, features
  • fix — Bug fixes, small mechanical updates
  • design — Architecture, planning, sketches
  • test — Test writing
  • docs — Documentation, written critique, synthesis
  • refactor — Restructuring without behavior change

Expected classifications (verify against task content):

  • update-tracking-docs → fix
  • stocktake-poietic-pbc → research
  • independent-review-poietic → docs (critical review writing)
  • visual-language-study → research / design
  • engage-pr1-review → docs (review writing) or research
  • direction-synthesis-virtual → docs (synthesis)

2. Assign agent identity

Run wg agency stats --by-task-type to see role performance by task type. Use the recommended role for the classified type. If "(insufficient data)", fall back to the overall Role Leaderboard.

For JSON: wg agency stats --by-task-type --json and read .task_type_breakdown.recommendations[].best_role.

Apply: wg assign <task-id> <agent-hash>

3. Select model tier

From the same stats output, check the Best Model by Task Type table.

Override heuristics when data is sparse:

SignalModel
Simple mechanical (e.g. update-tracking-docs)haiku
Standard research, docs, designsonnet
High-stakes synthesis with strong-opinion output (e.g. direction-synthesis-virtual)opus

Apply: wg edit <task-id> --model <tier>

4. Release for execution

wg publish <task-id> (the tasks are in draft/paused mode; wg publish releases them).

If wg publish does not exist, try wg resume <task-id>.

Validation

  • Every listed task has an agent assigned (verify via wg show)
  • Every listed task has a model set
  • Every listed task is released (status: open, not paused/draft)
  • update-tracking-docs is set to haiku (mechanical work)
  • direction-synthesis-virtual is set to opus (high-stakes synthesis)
  • Assignments justified by evaluation data, not arbitrary

Depends on

Required by

Log