Add variance-stratified residual-Mahalanobis FDR flavor by katosh · Pull Request #8 · settylab/kompot

katosh · 2026-04-18T00:34:21Z

Summary

Opt-in FDR calibration mode FDRSettings(mode="variance_stratified") that residualises log(1 + D²) against a smooth surface in (log_mean, log_var) fit on the permutation null, then applies kompot's existing 1-D local FDR to the standardised residual Z. Intended for low-replication designs (≈2 biological replicates per condition), where the raw Mahalanobis is dominated by per-gene variance rather than condition-specific signal.

Default behaviour is unchanged (mode="raw").
Raw *_mahalanobis, *_mahalanobis_local_fdr, and *_is_de columns are preserved.
New columns added alongside:
*_residual_mahalanobis, *_residual_z, *_residual_local_fdr, *_residual_is_de
(plus _pvalue / _tail_fdr / _log_mean / _log_var under store_additional_stats=True).
Public helpers in kompot.residualization for post-hoc use on any existing kompot output.

Background

When GPSettings.use_empirical_variance=False (kompot's default — chosen so per-cell sampling noise is not conflated with epistemic posterior uncertainty), D²_g scales monotonically with per-gene expression variance σ²_g. The gene-shuffled null inherits that scaling, so the 1-D local FDR is genome-wide calibrated but not gene-specific. With ~2 replicates per condition, any 2-vs-2 partition — including the real one — produces mean differences roughly proportional to σ²_g, so high-variance genes rank at the top under any partition.

Method

Per gene: m_g = log1p(mean_g), v_g = log1p(var_g) from the DE expression layer.
Fit null-trend surface φ̂(m, v) = E[log(1 + D²_null) | m, v] via OLS on a 7-term tensor polynomial over the gene-shuffled null draws.
Homoscedastic null residual scale σ̂_null.
R_g = log(1 + D²_g) − φ̂(m_g, v_g), Z_g = R_g / σ̂_null.
Apply kompot's existing 1-D local FDR to max(0, Z_g). Below-null genes get local_fdr ≈ 1.

Tal1 chimera validation

Post-hoc on /export/chimera_tal1_kompot_ps01_*:

	raw top-20	residual top-20
Hematopoietic (Hbb-bh1, Itga2b, Lyl1, Stab1/2, Tfec, Ackr1, …)	5	11
Imprinted (Cdkn1c, Phlda2, Grb10, Impact, Mcts2, Nnat, …)	12	0
Trend R² on Tal1 null	—	0.88
`σ̂_null` on Tal1 null	—	0.48

Residual top-20 (in order): Hbb-bh1, Hba-x, Hbb-y, Stab2, Itga2b, Stab1, Tfec, Aqp8, Rplp0, Rpl41, Hba-a1, Slc22a18, Ackr1, Ushbp1, Erdr1, Rps18, Adora2a, Clec14a, Lyl1, Hba-a2. Matches the known Tal1 hematopoietic phenotype; the raw list was dominated by high-variance imprinted / X-linked genes.

Design doc

Full derivation, evidence table, and limitations section (also destined for the methods supplement) live in the design note reports/kompot_notebooks_private_2026-04-17_142500_residual-mahalanobis-flavor.md of the companion kompot_notebooks_private workspace.

What this does and does not do

Does

Remove the strictly gene-level monotone dependence on (mean, variance) from the statistic before FDR calibration.
Rank genes by how unusual their Mahalanobis is conditional on their (m, v).
Reuse kompot's existing GP fit / Mahalanobis / null-gene machinery; no GP changes.

Does not

Remove manifold / cell-type structure. Genes that vary strongly between cell types will still have large Mahalanobis under any cell partition that splits cell-type mixtures.
Fix the underlying experimental-design limitation of two biological replicates per condition. With sample fully confounded with condition, no statistic can separate condition-level biology from embryo-to-embryo variability in principle; the residual flavor correctly refuses to rank variability genes first but does not convert a 4-embryo design into a well-replicated one.

Settings

FDRSettings(
    mode="variance_stratified",               # opt-in; 'raw' is default
    null_trend_features=("log_mean", "log_var"),
    null_trend_model="poly3",                  # 7-term tensor polynomial (default)
    # or "poly3_mean_only" for a diagnostic mean-only model
)

Test plan

19 new tests in tests/test_residualization.py (feature extraction, null-trend fit, residualisation math, end-to-end through kompot.de, settings validation) — all pass.
Full FDR test suite (test_fdr_utils, test_fdr_edge_cases, test_fdr_integration, test_model_settings): 75 / 75 pass.
Full DE test suite (test_differential, test_differential_expression_core, test_differential_expression_comprehensive, test_de_comprehensive_params, test_field_tracking, test_anndata_tracking): 74 pass, 4 pre-existing skips.
Tal1 top-20 flip validated post-hoc (see table above).
Upstream CI on PR branch.

Files

kompot/residualization.py — pure-numpy module with compute_gene_features, fit_null_trend, residualize_mahalanobis, residual_local_fdr
kompot/settings.py — FDRSettings knobs: mode, null_trend_features, null_trend_model
kompot/anndata/_de_helpers.py — _compute_residual_fdr helper; extended _compute_fdr, _store_de_results, _build_field_mapping, _check_overwrites
kompot/anndata/differential_expression.py — plumbs settings through de() and legacy compute_differential_expression()
kompot/anndata/utils/field_tracking.py — residual field-name keys
docs/source/variance_stratified_fdr.rst — user-facing guide
docs/source/index.rst, README.md, CHANGELOG.md
tests/test_residualization.py

- Add kompot smooth section with usage, options, and examples - Add smooth to overview, quick start workflow, config templates, help - Fix all \\ to single \ in code-block directives (RST renders literally)

The per-command basic examples repeated the same info as the workflow block above and the detailed command sections below.

…abels - Exclude compute_differential_abundance/expression from automodule - Add smooth_expression automodule (exclude deprecated compute_smoothed_expression) - Add RunInfo.to_settings() and call_args() to documented members - Rename "Gene Expression Imputation" to "Smoothing" in toctree

- Add --dry-run flag to kompot de: estimates resource requirements without running the analysis - JSON output to stdout (or -o file), human-readable report to stderr - Exit code 0 if feasible, 1 if not - Add ResourcePlan.to_dict() for machine-parseable serialization - Add configure_logging() to redirect kompot logger stream - CLI now logs to stderr by default (keeps stdout clean for machine-parseable output like --dry-run and --table-output) - Document --dry-run in CLI docs with pipeline examples

Users would likely reuse the same args as the real run, which would overwrite their h5ad output with a JSON file. Stdout-only is safer.

- Add dry_run=False to all 8 DE test Namespace objects (run_de now accesses args.dry_run from the --dry-run flag added in this PR) - Update actions/checkout v4→v6 and actions/setup-python v5→v6 to resolve Node.js 20 deprecation warnings

- Add TestCLIDryRun: covers --dry-run JSON output, infeasible exit code 1, and output validation skip when dry_run=True - Add TestConfigureLogging: covers stream redirection and default - Fix codecov action: file→files (v5 API change) - Add --cov-report=term-missing to pytest so coverage stats appear in CI logs

When GPSettings.use_empirical_variance=False (the default) the Mahalanobis statistic D^2 scales monotonically with per-gene expression variance; the gene-shuffled null inherits that scaling, so the 1-D local FDR is genome-wide calibrated but not gene-specific. In designs with ~2 biological replicates per condition this causes the top DE list to be dominated by high-variance/variability genes rather than condition-specific signal. Add an opt-in FDRSettings.mode='variance_stratified' that: * fits a smooth null-trend surface phi_hat(log_mean, log_var) on log1p(null_mahalanobis) via OLS on a 7-term tensor polynomial * residualises log1p(D^2) and standardises by the homoscedastic null residual scale * applies kompot's existing 1-D local FDR to max(0, Z) Outputs are written to adata.var alongside the untouched raw columns: *_residual_mahalanobis, *_residual_z, *_residual_local_fdr, *_residual_is_de (and pvalue/tail_fdr/log_mean/log_var under store_additional_stats=True). Public helpers live in kompot.residualization for post-hoc use. Tal1 chimera validation: top-20 flips from 12 imprinted / 5 hematopoietic (raw) to 0 imprinted / 11 hematopoietic (residual), including the predicted Itga2b, Lyl1, Stab1/2, Tfec, Ackr1. Default behaviour is unchanged; mode='raw' remains the default.

katosh and others added 12 commits April 13, 2026 17:51

Add smooth command to CLI docs, fix double-backslash rendering

54e614b

- Add kompot smooth section with usage, options, and examples - Add smooth to overview, quick start workflow, config templates, help - Fix all \\ to single \ in code-block directives (RST renders literally)

Remove redundant Quick Start examples from CLI docs

aafc175

The per-command basic examples repeated the same info as the workflow block above and the detailed command sections below.

Remove -o file option from --dry-run to prevent accidental overwrites

d7d8da0

Users would likely reuse the same args as the real run, which would overwrite their h5ad output with a JSON file. Stdout-only is safer.

Add Unreleased changelog section for post-v0.7.0 changes

7f63749

Resolve merge conflict in cli.rst: keep simplified quick start

c691403

Fix missing sys import in configure_logging tests

5adc699

Test: update _settings_to_dict snapshot for new FDR fields

ef9b24e

katosh closed this Apr 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add variance-stratified residual-Mahalanobis FDR flavor#8

Add variance-stratified residual-Mahalanobis FDR flavor#8
katosh wants to merge 12 commits into
mainfrom
residual-mahalanobis-flavor

katosh commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

katosh commented Apr 18, 2026

Summary

Background

Method

Tal1 chimera validation

Design doc

What this does and does not do

Settings

Test plan

Files

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant