quality-pass-20260428t214500

.quality-pass-20260428T214500

Metadata

Statusdone
Assignedagent-957
Agent identityeea940a6f6be13d60578dee27be1f4bade4fcaab05bbbe54b9c5ef4b2d05eae0
Created2026-04-28T21:54:51.413162671+00:00
Started2026-04-28T21:56:23.805479214+00:00
Completed2026-04-28T21:59:08.386248514+00:00
Tagseval-scheduled
Eval score0.93
└ blocking impact0.95
└ completeness0.95
└ coordination overhead0.93
└ correctness0.95
└ downstream usability0.92
└ efficiency0.90
└ intent fidelity0.82
└ style adherence0.88

Description

Quality Pass: Post-Triage Review

Review and optimize task metadata for newly created tasks before they enter execution.

Tasks to review

  • diagnose-tui-new
  • fix-tui-new

What to do

For EACH task listed above:

1. Classify task type

Read the task via wg show <task-id>. Classify as one of:

  • research — Investigation, analysis, library evaluation
  • implementation — New code, features, endpoints
  • fix — Bug fixes, error corrections
  • design — Architecture, API design, planning
  • test — Test writing, test infrastructure
  • docs — Documentation, comments, guides
  • refactor — Code restructuring without behavior change

(Hint: diagnose-tui-new is research; fix-tui-new is fix.)

2. Assign agent identity

Run wg agency stats --by-task-type to see role performance by task type. Use the recommended role for the task's classified type. If '(insufficient data)', fall back to the overall Role Leaderboard.

For JSON: wg agency stats --by-task-type --json.task_type_breakdown.recommendations[].best_role.

Apply: wg assign <task-id> <agent-hash>

3. Select model tier

Check the Best Model by Task Type table from the same stats command. Override heuristics:

SignalModel
Simple, mechanical, well-definedhaiku
Standard implementation, testing, researchsonnet
Complex design, multi-system reasoningopus
Task has failed beforeescalate one tier

Apply: wg edit <task-id> --model <tier>

4. Release for execution

After assigning agent and model: wg resume <task-id>

Validation

  • Both tasks have an agent assigned (check via wg show)
  • Both tasks have a model set
  • Both tasks are un-paused (status: open, not paused)
  • Assignments are justified by evaluation data, not arbitrary

Depends on

Required by

Log