audit-sweepga-paf-filter-identity-scoring

Audit sweepGA PAF filtering scoring for chunk identity

Metadata

Statusdone
Assignedagent-2665
Agent identity3184716484e6f0ea08bb13539daf07686ee79d440505f1fdf2de0357707034c3
Created2026-06-22T16:05:02.264128998+00:00
Started2026-06-22T16:08:21.817771393+00:00
Completed2026-06-22T16:13:03.871457168+00:00
Tagssweepga, paf, validation, scoring, eval-scheduled
Eval score0.72
└ hallucination rate0.30
└ requirement coverage0.90
└ semantic match0.55
└ specificity match0.85

Description

Problem: For chopped PAF sensitivity, sweepGA must filter local chunks by per-chunk identity/ANI, not by length, matches, log-length-ANI, or scaffolded/merged context. The current commands used --num-mappings 1:1 --scaffold-jump 0 but did not explicitly prove identity-only scoring.

Task:

  • Inspect /home/erikg/.cargo/bin/sweepga --help and, if source is available locally, inspect sweepGA PAF filtering/scoring implementation.
  • Determine the exact command flags needed for per-chunk identity filtering. Candidate flags include --scoring ani, --scaffold-jump 0, and avoiding any minimum-length or adaptive-scaffold behavior that would change chunk-level interpretation.
  • Create synthetic PAF fixtures with equal and unequal lengths, matches, identities, overlapping query/target intervals, and repeated target choices. Run sweepGA PAF filtering on them to empirically verify the selected scoring chooses higher identity over longer/lower-identity blocks.
  • Confirm whether sweepGA uses only PAF col10/col11 for identity, optional tags like de/dv, or other fields.
  • Produce a minimal recommended command for validated chunk filtering.

Acceptance:

  • Report gives a direct yes/no: does default sweepGA PAF filtering rank by length-weighted score? does --scoring ani rank by identity per chunk?
  • Synthetic tests demonstrate the chosen command retains the higher-identity chunk when length conflicts with identity.
  • Recommended command includes all necessary flags and explicitly disables scaffolding/merging.
  • Results written to SWEEPGA_PAF_FILTER_IDENTITY_AUDIT.md and a TSV summary.
  • Commit and push with WG provenance.

Depends on

Required by

Log