figure-ed3-ed4-annotation-genes

ED3+ED4: annotation (TAR1/ITS/telo) + gene enrichment

Metadata

Statusdone
Assignedagent-697
Agent identity3184716484e6f0ea08bb13539daf07686ee79d440505f1fdf2de0357707034c3
Created2026-05-05T05:00:06.303863688+00:00
Started2026-05-05T05:28:15.929325549+00:00
Completed2026-05-05T05:42:15.183716253+00:00
Tagspaper-prep,figure, eval-scheduled
Eval score0.88
└ blocking impact0.90
└ completeness0.90
└ constraint fidelity0.85
└ coordination overhead0.85
└ correctness0.85
└ downstream usability0.90
└ efficiency0.85
└ intent fidelity0.94
└ style adherence0.90

Description

Description

Produce ED3 (4 panels) annotation and ED4 (4 panels) gene enrichment. Implement directly — do not decompose further.

File scope

  • paper_prep/figures/ed3/figure_ed3.{pdf,png,R/py}, caption.md, sources.tsv
  • paper_prep/figures/ed4/figure_ed4.{pdf,png,R/py}, caption.md, sources.tsv

Figure spec (excerpt from MANUSCRIPT_SKELETON.md ED3, ED4)

ED3 — Annotation: TAR1 + internal (TTAGGG)n + telomere length

PanelContentStatusSource
ED3aTAR1 prevalence per arm (PAR1 absence; acrocentric intermediate; autosomal saturation)GENERATEcommunity_tar1_by_arm.tsvSURVEY_02 §6 Fig M1a
ED3bInternal (TTAGGG)n island length distribution + canonical-fraction histogramGENERATElength_distribution.tsv + motif_composition.tsvSURVEY_02 §6 Fig M2
ED3cTerminal telomere length by community (Kruskal-Wallis H = 100.89, p = 3.2e-15)GENERATE.telo.tsv joined to community assignments — SURVEY_02 §6 ED3
ED3dPer-arm TAR1 positional distance-from-telomereGENERATEtar1_positional_per_arm.tsvSURVEY_02 §6 Fig M1b

ED4 — Gene enrichment, pseudogene gradient, copy-weighted GO

PanelContentStatusSource
ED4aGSEA / GO:BP top terms (snRNP, olfactory, sensory) — vertical barREADYFigure1_GSEA_BP_vertical.pdf (1 Mb caveat — flag PHR-only re-run) — SURVEY_FIG_inv §3
ED4bCopy-weighted vs deduplicated comparison (olfactory fold = 598)GENERATEimproved_copy_weighted_vs_deduplicated_comparison.csvSURVEY_DATA §4
ED4cHigh-copy gene families (DUX4 ×18, BAGE2, MTCO, RPL23A, SEPTIN14P22, OR4F)GENERATEgene_copy_summary.csvSURVEY_DATA §2
ED4dOR4F pseudogenisation gradient (62.1 % pseudogene; 11.1 % chr7_p → 99.8 % chr15_q)GENERATEper-arm pseudogene fraction — SURVEY_10/11/12 C12

Validation

  • All 8 panels; sources.tsv per ED
  • Captions ≤ 200 words; ≥ 2 metrics with TSV paths
  • PDF + PNG per ED
  • ED4a caption notes 1 Mb window caveat (PHR-only re-run flagged in WORK_DECOMPOSITION.md ## Gaps)

Inputs

  • paper_prep/synthesis/MANUSCRIPT_SKELETON.md
  • paper_prep/surveys/SURVEY_02_annotation.md, SURVEY_DATA_inventory.md, SURVEY_10_11_12_limits_summary_lit.md

Depends on

Required by

Log