figure-ed1-ed2-pipeline-seqlevel

ED1+ED2: pipeline + sequence-level (50-community) detail

Metadata

Statusdone
Assignedagent-696
Agent identity3184716484e6f0ea08bb13539daf07686ee79d440505f1fdf2de0357707034c3
Created2026-05-05T04:58:03.741267681+00:00
Started2026-05-05T05:28:12.319585839+00:00
Completed2026-05-05T05:51:22.419001555+00:00
Tagspaper-prep,figure, eval-scheduled
Eval score0.90
└ blocking impact0.88
└ completeness0.95
└ constraint fidelity0.85
└ coordination overhead0.90
└ correctness0.92
└ downstream usability0.92
└ efficiency0.86
└ intent fidelity0.95
└ style adherence0.88

Description

Description

Produce Extended Data ED1 (4 panels) and ED2 (4 panels). Implement directly — do not decompose further.

File scope

  • paper_prep/figures/ed1/figure_ed1.{pdf,png,R/py}, caption.md, sources.tsv
  • paper_prep/figures/ed2/figure_ed2.{pdf,png,R/py}, caption.md, sources.tsv

Figure spec (excerpt from MANUSCRIPT_SKELETON.md ED1, ED2)

ED1 — Pipeline and per-arm flank inventory

PanelContentStatusSource
ED1aPipeline schematic: 465 assemblies → 18,827 flanks → 15,668 PHRs → 15/50 communitiesGENERATEnew schematic
ED1bPer-arm flank counts (48 arms) with assembly QC overlayGENERATEcontig_classifications.tsvSURVEY_01 §3
ED1cPHR length distribution (median 105 kb, mean 144 kb)GENERATEall-vs-all.1Mb.p95.id95.len.tsvSURVEY_01 §3
ED1dChr18_q (NA18982#1) chimera evidence — wfmash + minimap2 dotplot + NNN gap + FlaggerGENERATESURVEY_01 §1.5, §5 item 6

ED2 — Sequence-level (50-community) detail

PanelContentStatusSource
ED2aUMAP / force-directed layout coloured by 50-community partitionREADY/compositeplot-seq-community-structure.R outputs (/moosefs/.../similarity/)
ED2bWithin-community Jaccard distance bimodality (C1, C2, C3, C5, C6, C7, C11, C12)GENERATEsimilarity.tsv.gz per community subsets — SURVEY_04 §1.10, §6 F10
ED2cCross-arm affinity circular plot — 41 arms with edges weighted by absorbed sequencesGENERATEcross_arm_affinity_sequences.tsvSURVEY_01 §6 F5
ED2dConfusion matrix Arm-Leiden vs Sequence-Leiden (15 × 50; ARI 0.35, NMI 0.76)GENERATEarm-leiden vs seq-leiden assignment TSVs

Validation

  • All 8 panels (4 in ED1 + 4 in ED2); sources.tsv per ED
  • Captions ≤ 200 words each; ≥ 2 metrics with TSV paths
  • PDF + PNG per ED

Inputs

  • paper_prep/synthesis/MANUSCRIPT_SKELETON.md
  • paper_prep/surveys/SURVEY_01_pipeline.md, SURVEY_04_heterogeneity.md

Depends on

Required by

Log