fig5-f32-query-grid-chop-filter-rerun

Fig5 f32 query-grid chop/filter rerun

Metadata

Statusdone
Assignedagent-2727
Agent identity46f6237a65ec4f1002c4d3fb201dc8633638d0947c276be7008c227e1051ba5e
Created2026-06-24T09:03:01.606384637+00:00
Started2026-06-24T22:54:24.525472792+00:00
Completed2026-06-25T09:06:10.241319889+00:00
Tagsfig5, sweepga, fastga, frequency32, query-grid, slurm, eval-scheduled

Description

Run query-grid chop/filter for the Fig5 f32 raw SweepGA/FastGA alignment iteration.

Dependency: use raw f32 many:many PAFs from fig5-sweepga-fastga-frequency32-raw and the merged query-grid pafchop-rs.

Required run shape:

  • Work only under paper_prep/_brainstorming/pedigree_whole_genome_sweepga_fastga_frequency32/.
  • Chop from raw f32 PAFs directly, not from prior chopped outputs.
  • Use pafchop --chunk-mode query-grid --overlap 0 for chop lengths 10000, 5000, 2000. Do not attempt 1000 unless explicitly added later; f16 1kb was cancelled for runtime.
  • Use distinct output dirs: chopped_paf_qgrid_l{N}_o0 and filtered_paf_chop_sensitivity_query_grid/l{N}.
  • Filter each query-grid chopped PAF with SweepGA: --num-mappings 1:1 --scaffold-jump 0 --scoring ani --overlap 0.
  • Use Slurm/job arrays and /dev/shm scratch; do not run heavy chopping/filtering on the head node. Use pigz for compression/decompression.

Validation:

  • pigz -t all chopped and filtered PAFs.
  • Write sha256 sidecars.
  • Record job IDs, hosts, commands, binary paths, binary sha256, chunk mode, length, threads, scratch dir, and status in summary TSVs.
  • Write a shifted-boundary audit proving f32 chunks are on the absolute query grid.

Acceptance criteria:

  • All 9 required comparison x length filtered outputs exist and validate.
  • query_grid_chop_filter_manifest.tsv clearly distinguishes f32 from f16.
  • README or notes state the exact f32 settings and point to f16 for comparison.
  • Commit with message: feat: fig5-f32-query-grid-chop-filter-rerun (agent-NNN)

Depends on

Required by

Log