quality-pass-20260412t184600

.quality-pass-20260412T184600

Metadata

Statusdone
Assignedagent-399
Agent identity3184716484e6f0ea08bb13539daf07686ee79d440505f1fdf2de0357707034c3
Created2026-04-12T18:46:35.251442108+00:00
Started2026-04-12T18:53:35.476484728+00:00
Completed2026-04-12T18:53:38.103822039+00:00
Tagseval-scheduled
Eval score0.83
└ blocking impact0.80
└ completeness0.90
└ coordination overhead0.85
└ correctness0.85
└ downstream usability0.80
└ efficiency0.80
└ intent fidelity0.81
└ style adherence0.85

Description

Quality Pass: Post-Triage Review

Review and optimize task metadata for newly created tasks before they enter execution.

Tasks to review

  • review-and-merge
  • fix-terminology-single

What to do

For EACH task listed above:

1. Classify task type

Read the task via wg show <task-id>. Classify as one of:

  • research — Investigation, analysis, library evaluation
  • implementation — New code, features, endpoints
  • fix — Bug fixes, error corrections
  • design — Architecture, API design, planning
  • test — Test writing, test infrastructure
  • docs — Documentation, comments, guides
  • refactor — Code restructuring without behavior change

2. Assign agent identity

Run wg agency stats --by-task-type to see role performance by task type. The Recommendations by Task Type table shows the best role for each type.

Use the recommended role for the task's classified type. If the recommendation says '(insufficient data)', fall back to the overall Role Leaderboard.

For JSON access (machine-readable): wg agency stats --by-task-type --json Look at .task_type_breakdown.recommendations[].best_role.

Apply: wg assign <task-id> <agent-hash>

3. Select model tier

Run wg agency stats --by-task-type and check the Best Model by Task Type table. Use the top-scoring model for the task's classified type.

For JSON access: .task_type_breakdown.recommendations[].best_model.

Override heuristics (when evaluation data is sparse or absent):

SignalModel
Simple, mechanical, well-defined (e.g., 'add a flag', 'rename X')haiku
Standard implementation, testing, researchsonnet
Complex design, multi-system reasoning, novel architectureopus
Task has failed before (check status history)escalate one tier

Apply: wg edit <task-id> --model <tier>

4. Release for execution

After assigning agent and model: wg resume <task-id>

Validation

  • Every listed task has an agent assigned (check via wg show)
  • Every listed task has a model set
  • Every listed task is un-paused (status: open, not paused)
  • Assignments are justified by evaluation data, not arbitrary

Depends on

Required by

Log