test: coverage initiative — kernels, readers, with_* API by d-laub · Pull Request #195 · mcvickerlab/GenVarLoader

d-laub · 2026-05-25T19:23:26Z

Summary

Three-wave test-coverage initiative.

Wave 1 — numba kernel behavior tests (coverage.py can't instrument numba):

Extended reconstruct_haplotype_from_sparse case matrix (ref-only, deletion spanning region end, overlapping variants)
New get_diffs_sparse tests covering both fast and slow paths
New filter_af tests across min/max/both/no-op + 2-D offsets layout (xfail — see Findings)
Filled choose_exonic_variants scenario gaps (spans start/end, entirely before/after)
New intervals_to_tracks tests

Wave 2 — user-facing readers + DataLoader:

_fasta.py: missing contig, protocol attrs, zero-length range
_bigwig.py: read smoke, protocol attrs, missing path raises RuntimeError
_torch.py: 6 smoke tests for get_dataloader / get_sampler / to_dataloader (skip cleanly via importorskip when torch missing; verified passing in py310/py311/py312/py313 envs)

Wave 3 — API surface:

Dataset.with_* matrix: with_settings, with_len, with_seqs, with_tracks, with_insertion_fill
Dataset.open error paths: missing dir, missing metadata, missing input_regions, ploidy mismatch, empty dataset
_indexing.py edge cases: negative indices, OOB, step slices, boolean masks, empty selections

Coverage config (pyproject.toml):

Omitted _dataset/_intervals.py (numba-only)
Added @nb.njit / @numba.njit / PyTorch ImportError to exclude_lines

Coverage impact

Overall: 63% → 74%. Key modules:

_genotypes.py: 5% → 100% (numba exclusion working)
_intervals.py: 12% → omitted
_bigwig.py: 47% → 61%
_open.py: 80% → 87%
_tracks.py: 41% → 79%
_torch.py: stays 31% in dev env (torch not installed); fully exercised in py3xx envs

Full report: docs/superpowers/specs/2026-05-25-test-coverage-after.txt.

Findings worth surfacing

filter_af 2-D offsets layout is broken — kernel calls lengths_to_offsets (plain numpy) from within @nb.njit, which raises a TypingError. Test is xfail(strict=True); fix would be @nb.njit-decorating lengths_to_offsets in _utils.py.
_torch.py ABI quirk — must import numpy before pytest.importorskip("torch") in py310 env to avoid a Fatal Python error during torch import. Documented in the test file with a noqa.

Test plan

pixi run -e dev pytest tests — 413 passed, 4 skipped, 3 xfailed
pixi run -e dev cargo test --release — 4 passed
pixi run -e py310 pytest tests/unit/test_torch.py — 6 passed
pixi run -e dev ruff check python/ tests/ — clean
pixi run -e dev typecheck — 0 errors

Spec: docs/superpowers/specs/2026-05-25-test-coverage-design.md
Plan: docs/superpowers/plans/2026-05-25-test-coverage-implementation.md

🤖 Generated with Claude Code

Three-wave plan in a single bundled PR: numba-kernel behavior tests, user-facing readers + DataLoader, and Dataset.with_* / open/indexing API surface. Excludes numba-only modules from the coverage gate since coverage.py can't instrument them.

19 tests across with_settings, with_len, with_seqs, with_tracks, and with_insertion_fill confirming each returns a new lazy view without mutating the original, and rejects invalid input. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Omit _intervals.py (numba-only) and add exclude_lines for @nb.njit, @numba.njit, and the PyTorch ImportError guard. Saves the new coverage baseline (65% overall, _genotypes 5%→100%). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…rch imports Two findings from the test-coverage initiative folded in: 1. filter_af's 2-D geno_offsets path called lengths_to_offsets (plain numpy) from inside @nb.njit, which numba could not compile. Replaced with an inline cumulative-sum loop. Unxfails test_filter_af_2d_offsets_layout. 2. test_torch.py imported torch via pytest.importorskip before genvarloader, which caused a numpy/torch ABI abort in py310 env. Reordered to import genvarloader first (which loads numpy in the order torch expects), then importorskip torch.

…me/int)

…oader reproducible Adds tests/integration/dataset/test_determinism.py covering: - two opens with the same rng seed + jitter produce identical reads - jitter=0 reads are byte-identical across repeated lookups - (xfail) seeded torch.Generator should give reproducible shuffled batches. Currently broken: get_sampler() builds RandomSampler without forwarding the DataLoader's generator, so the sampler uses the global torch RNG. Note: also surfaced that with_settings(rng=...) is broken in _impl.py (line 271 writes "rng" into to_evolve but the dataclass field is _rng), so the seed-equality test seeds via Dataset.open(..., rng=...) instead.

…ntig

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…r for reproducible shuffle

…output

…s= not attrs=)

…data-gen tracks

…bio state issue

d-laub and others added 30 commits May 25, 2026 11:34

docs(spec): test coverage initiative

47602bf

Three-wave plan in a single bundled PR: numba-kernel behavior tests, user-facing readers + DataLoader, and Dataset.with_* / open/indexing API surface. Excludes numba-only modules from the coverage gate since coverage.py can't instrument them.

docs(plan): test coverage implementation plan

c1f2e62

test(kernels): extend reconstruct_haplotype case matrix

7a013a5

test(kernels): add get_diffs_sparse behavior tests

3bc71ee

test(kernels): add filter_af coverage across input layouts

ad3d460

test(kernels): fill choose_exonic_variants scenario gaps

ac4f523

test(kernels): add intervals_to_tracks behavior tests

3c59d07

test(fasta): cover missing contig, protocol attrs, zero-length

48dbb66

test(bigwig): cover read smoke path, protocol attrs, missing file

d411544

test(bigwig): tighten missing-path raises to RuntimeError

36deb66

test(torch): cover get_dataloader/get_sampler smoke paths

5093dc2

test(torch): drop unused numpy import

486cf92

test(torch): re-add numpy import before torch (ABI workaround)

c0f7181

test(dataset): cover with_* method matrix

7deccf2

19 tests across with_settings, with_len, with_seqs, with_tracks, and with_insertion_fill confirming each returns a new lazy view without mutating the original, and rejects invalid input. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

test(open): cover error paths on Dataset.open

c963f0a

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

test(indexing): cover negative, oob, boolean-mask, empty selections

bd3de7f

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(cov): regenerate baseline with full test data accessible

18dca3f

test(torch): noqa E402 for imports after importorskip

4b82588

style: ruff format pass

6e68d74

docs(spec): test coverage deeper initiative

c74c2d8

docs(plan): test coverage deeper implementation plan

93bb589

test(parity): oracle parity for annotated and reference output modes

8becb4d

test(parity): cross-mode equivalence (VCF/PGEN/SVAR, Ragged/Array, na…

f8a552f

…me/int)

test(kernels): cross-check fast/slow paths and 1d/2d layouts agree

ea302bb

test(determinism): xfail capture for with_settings(rng=...) bug

dd9547a

test(query): cover filter combinations and empty-result behavior

16675b6

test(ragged): cover public utilities in _ragged

ac22379

d-laub and others added 21 commits May 25, 2026 18:02

test(ragged): xfail capture for to_padded numeric branch bug

4bb252a

test(sitesonly): fill INFO-field and empty-region gaps

6de011c

test(dataset-utils): cover public helpers in _dataset/_utils

fcd4366

test(torch): cover return_indices and transform paths

92a7659

test(filter_af): pin down NaN handling contract

0b608c9

test(shapes): single-region, single-sample, empty subset

857c9e0

test(write): round-trip on empty BED, overlapping regions, missing co…

61425b5

…ntig

style(tests): remove unused polars import in test_dataset_utils

767c429

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

docs(cov): new baseline after deeper coverage initiative

a2f1c17

style(tests): ruff format

aa1568c

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

fix(dataset): with_settings(rng=...) maps to _rng dataclass field

7bb614d

fix(torch): forward generator from DataLoader through to RandomSample…

f15f54a

…r for reproducible shuffle

fix(ragged): to_padded numeric branch handles clip=True RegularArray …

d34ebee

…output

fix(sitesonly): use correct genoray VCF.get_record_info kwargs (field…

a683abc

…s= not attrs=)

test(boundary): contig-end region pads with N on read

ea4fc61

test(boundary): spanning deletion at region/contig boundary

30caa79

docs(cov): rebaseline after bug fixes and boundary tests

30eb1e3

style: ruff format pass

932f8c3

test(with-methods): self-build tracks-enabled dataset; don't rely on …

499662d

…data-gen tracks

style: ruff format pass

d227a1d

test(with-methods): use bigwig-backed tracks fixture to avoid polars_…

c4d782e

…bio state issue

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: coverage initiative — kernels, readers, with_* API#195

test: coverage initiative — kernels, readers, with_* API#195
d-laub wants to merge 51 commits into
mainfrom
worktree-test-coverage-initiative

d-laub commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

d-laub commented May 25, 2026

Summary

Coverage impact

Findings worth surfacing

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant