Skip to content

Release 0.4.0#75

Merged
lucapinello merged 2 commits intomainfrom
release/0.4.0
Apr 30, 2026
Merged

Release 0.4.0#75
lucapinello merged 2 commits intomainfrom
release/0.4.0

Conversation

@lucapinello
Copy link
Copy Markdown
Contributor

Headline changes since 0.3.0

ChromBPNet CDFs rebuilt against chrombpnet_nobias

The 0.2.x CDFs on lucapinello/chorus-backgrounds were built against the bias-aware chrombpnet variant. After the 0.3.0 default flipped to chrombpnet_nobias, percentile lookups were doing chrombpnet_nobias predictions → chrombpnet empirical CDFs — biased.

Rebuilt on A100 in ~10h. New NPZ:

  • 786 tracks (22 ATAC + 20 DNASE + 744 CHIP)
  • All CDFs monotone, every reservoir filled (effect_count=9609 per track)
  • sha256: be61e9e8f9b919b43c599b7fbc9deb74f8f1e6dc1da5e2cdb92036a85bf13205
  • ATAC/DNase p95 shifts 13.5–29.3% (bias correction)
  • CHIP/BPNet p95 ~unchanged (already nobias-equivalent)

Audit: audits/2026-04-29_chrombpnet_cdf_rebuild/report.md.

HuggingFace mirror consolidation

Enformer / Borzoi / Sei / LegNet weights now fetch from chorus-controlled HF repos with original-source fallback. Closes the TFHub deprecation hazard. Audit: audits/2026-04-29_hf_mirror_consolidation/report.md.

AlphaGenome PyTorch backend

Second AlphaGenome oracle (alphagenome_pt) shipped alongside the JAX default. Same model + same weights converted to safetensors. recommend_alphagenome_backend() helper picks the right backend per platform + window size. Both install by default. Audit trail: audits/2026-04-29_alphagenome_pytorch_spike/, audits/2026-04-29_alphagenome_pt_stress_test/.

Env-setup hardening

HF-mirror direct-load fix (post-merge, PR #69)

Wired chorus mirror routing into Enformer + Borzoi _load_direct paths. Caught during the scorched-earth audit at audits/2026-04-30_overnight_full_audit/report.md.

Validation summary (from overnight scorched-earth audit)

  • Fresh install from zero: ✅ all 7 oracle envs rebuild via chorus setup
  • 7/7 oracles chorus health Healthy
  • 7/7 oracles load + predict + return finite values
  • All 3 shipped notebooks execute end-to-end (49 + 127 + 59 cells, 0 errors)
  • analyze_variant_multilayer end-to-end on SORT1 rs12740374 with the new CDF: PASS
  • Fast suite: 368 passed

Test plan

  • Fast suite local: 368 passed
  • Fresh install end-to-end: all phases green (audits/2026-04-30_overnight_full_audit/report.md)
  • New CDF sha256 verified against locally-built file
  • Variant analysis with new CDF: HepG2 DNase at SORT1 rs12740374 returns finite report
  • Linux CI fast suite

Open follow-up issues (not blocking)

🤖 Generated with Claude Code

lp698 and others added 2 commits April 30, 2026 06:52
Headline changes since 0.3.0:

- Rebuilt ChromBPNet per-track CDFs against chrombpnet_nobias (the 0.3+
  default). Prior CDFs targeted the bias-aware chrombpnet; effect
  percentiles now look up against the matching empirical distribution.
  786 tracks, all CDFs monotone, sha256 be61e9e8... on
  lucapinello/chorus-backgrounds. ATAC/DNase p95 shifts 13.5-29.3%
  (bias correction). Audit at audits/2026-04-29_chrombpnet_cdf_rebuild/.
- HuggingFace mirror consolidation: Enformer, Borzoi, Sei, LegNet now
  fetch from chorus-controlled HF repos (lucapinello/chorus-{enformer,
  borzoi,sei,legnet}) with original-source fallback. Closes the TFHub
  deprecation hazard. Audit at audits/2026-04-29_hf_mirror_consolidation/.
- AlphaGenome PyTorch backend (alphagenome_pt) shipped alongside the JAX
  default — same model, same weights converted to safetensors. Both
  install by default in `chorus setup`. Routing helper
  recommend_alphagenome_backend() suggests which to use based on
  platform + window size. Audit trail at
  audits/2026-04-29_alphagenome_pytorch_spike/ and
  audits/2026-04-29_alphagenome_pt_stress_test/.
- Env-setup hardening: timeout-soft policy in _setup_environment so
  cold-NFS validation timeouts don't silently downgrade
  use_environment=True; _check_env_ready() now plugged into every
  oracle's load_pretrained_model so missing-dep failures raise
  EnvironmentNotReadyError up-front instead of falling through to
  ModuleNotFoundError. Closes #64. Equivalence test rewritten via
  chorus._predict closes #63 + #65.
- HF-mirror direct-load fix (post-merge): wired chorus mirror routing
  into Enformer + Borzoi _load_direct paths (PR #69). Caught during the
  scorched-earth audit at audits/2026-04-30_overnight_full_audit/.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@lucapinello lucapinello merged commit 30387d0 into main Apr 30, 2026
1 check passed
@lucapinello lucapinello deleted the release/0.4.0 branch April 30, 2026 11:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

tests/test_alphagenome_backends_equivalence.py: rewrite to compare via chorus._predict (raw-head shape assertion always fails)

1 participant