fig5-raw-manymany-impg-similarity-2kb-single-node-race

Race Fig5 raw many:many IMPG 2kb full-BED single-node jobs

Metadata

Statusdone
Assignedagent-2859
Agent identity46f6237a65ec4f1002c4d3fb201dc8633638d0947c276be7008c227e1051ba5e
Created2026-06-27T15:57:17.254093855+00:00
Started2026-06-27T15:59:28.399249248+00:00
Completed2026-06-27T16:09:23.983264718+00:00
Tagsfig5, eval-scheduled

Description

Submit a no-array single-node Slurm race for the Fig5 raw many:many 2 kb IMPG similarity scan. Do not run WFMASH, SweepGA/FastGA, minimap2, seqwish, odgi, or any new alignment. Use existing raw unfiltered many:many PAFs and the existing 2 kb full-genome BEDs/manifests from paper_prep/_brainstorming/fig5_raw_manymany_impg_similarity_2kb_sharded/.

Goal: test the user's preferred simpler execution shape: one Slurm job per method x comparison, no array, one node, explicit threads from the allocation. For each of the six method/comparison pairs, call /home/erikg/.cargo/bin/impg similarity once with the full 2 kb target BED and --threads ${SLURM_CPUS_PER_TASK}. Use --cpus-per-task based on available node class; prefer 96 on tux nodes if the partition supports it, otherwise 48 on octopus/workers. Do not hard-code 1 thread.

Outputs should go to a separate scratch directory so they do not overwrite the sharded run: paper_prep/_brainstorming/fig5_raw_manymany_impg_similarity_2kb_single_node_race/. Submit a Slurm dependency finalizer for these six jobs, not a WG polling loop. Preserve all-hit IMPG outputs, and build best-per-window plotting summaries with the same rule as the sharded finalizer: one best interchromosomal hit per 2 kb target window; tie-break by estimated.identity, intersection, dice, cosine, jaccard, then stable lexical target coordinates.

Validation:

  • Exactly six non-array Slurm jobs submitted, one per method x comparison, using ${SLURM_CPUS_PER_TASK}.
  • No new alignment is performed.
  • Finalizer is a Slurm dependency job.
  • Report job IDs, node/CPU settings, output paths, and whether this is expected to supersede the current array run.
  • Do not cancel current arrays 1706840-1706845 unless explicitly instructed by the user.

Depends on

Required by

Log