step-1-extract-chm13

Step 1: Extract CHM13 PHR BED

Metadata

Statusdone
Assignedagent-13
Agent identity3184716484e6f0ea08bb13539daf07686ee79d440505f1fdf2de0357707034c3
Created2026-03-31T21:03:24.721336362+00:00
Started2026-03-31T21:07:09.180844815+00:00
Completed2026-03-31T21:08:24.263183727+00:00
Tagsimpl, eval-scheduled
Eval score0.84
└ blocking impact0.95
└ completeness1.00
└ coordination overhead0.95
└ correctness0.98
└ downstream usability0.93
└ efficiency0.95
└ intent fidelity0.66
└ style adherence0.92

Description

Goal

Create a clean BED file of CHM13 PHR intervals by extracting CHM13 rows from CHM13-HG002.sub-telo-phrs.bed and stripping PanSN prefixes.

Approach

grep '^CHM13#0#' CHM13-HG002.sub-telo-phrs.bed \
  | sed 's/CHM13#0#//' \
  > chm13.phrs.bed

Context

  • Check the research task output first for file locations and any caveats
  • Should produce ~37 intervals covering 41 chromosome arms
  • The BED columns are: chr, start, end, comma-separated list of chromosomes sharing sequence

Validation

  • chm13.phrs.bed exists and has ~37 lines
  • Chromosome names are standard (chr1, chr2, ...) without PanSN prefix
  • File is valid BED format (tab-separated, start < end)
  • Log the line count and first few lines

Depends on

Required by

Log