fix: njit fallback for broken runtime with torch by JhonatanFelix · Pull Request #164 · scverse/fast-array-utils

JhonatanFelix · 2026-04-08T21:03:12Z

This PR adds a narrow workaround in fast-array-utils’ custom njit wrapper for the Apple Silicon crash reported in scanpy when torch is loaded and a numba parallel path is used. The goal is the same “workaround” path suggested by flying-sheep in the Scanpy discussion: handle this in the shared runtime-dispatch layer, instead of removing parallelism in Scanpy itself.

The approach keeps the normal parallel path by default. But, on the macOS arm64, when torch is already loaded and the current threading config is not expicitly pinned to a safe layer like workqueue or tbb, the wrapper runs a small cached subprocess probe before using the parallel implementation. That probe will mirror the current Numba threading config, and it will reproduce the relevant import context, and finally checking whether a tiny @numba.njit(parallel=True) function actually ran successfully or not. If it does, the parallel version is used as usual. If it fails, the wrapper falls back to the already-compiled serial version and emits a warning.

I also narrowed the probe context baesd on reproduction work in the failing environnment. In that setup, torch was the relevant one, so the probe now mirrors only the loaded torch state, and the cache key includes that state as well. This keeps the workaround smaller and avoids reusing a cached “safe” result after torch is imported later.

Tests cover the probe gating logic, lazy detection behavior, env/config mirroring for the subprocess, cache-key behavior, subprocess success and failure cases, wrapper dispatch between serial and parallel implementations, and correctness of the serial fallback path.

for more information, see https://pre-commit.ci

codecov · 2026-04-08T21:04:49Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.29%. Comparing base (d5176d6) to head (57be99a).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #164      +/-   ##
==========================================
+ Coverage   99.22%   99.29%   +0.06%     
==========================================
  Files          20       21       +1     
  Lines         519      566      +47     
==========================================
+ Hits          515      562      +47     
  Misses          4        4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

codspeed-hq · 2026-04-08T21:14:56Z

Merging this PR will not alter performance

✅ 232 untouched benchmarks

_{Comparing JhonatanFelix:main (57be99a) with main (d5176d6)}

for more information, see https://pre-commit.ci

flying-sheep

Hi, thanks for this! I like the idea, with a subprocess even UB is unlikely to cause problems. Is the real fix tracked somewhere in the torch issue tracker or so?

I’m less of a fan of the gratuitous monkeypatching in the tests, but I get that avoiding it would mean re-architecting everything, e.g. by using a (data)class that can be initialized with all the variables that it by default gets from global constants and numba settings. No need to do that, I’ll do it when you’re done with the rest!

src/fast_array_utils/numba.py

tests/test_numba.py

JhonatanFelix · 2026-04-09T12:29:13Z

Hi, thanks for this! I like the idea, with a subprocess even UB is unlikely to cause problems. Is the real fix tracked somewhere in the torch issue tracker or so?

I’m less of a fan of the gratuitous monkeypatching in the tests, but I get that avoiding it would mean re-architecting everything, e.g. by using a (data)class that can be initialized with all the variables that it by default gets from global constants and numba settings. No need to do that, I’ll do it when you’re done with the rest!

Thanks!!!
About the torch: I actually didn't find a pytorch issue that tracks the scanpy/numbacrash itself. The closest upstream tracking I found was the problem of pytorch open macOS/OpenMP duplication issues, specially with the pytorch/pytorch#44282 and pytorch/pytorch#127973.

JhonatanFelix

Made a wrong comment here, sorry!

src/fast_array_utils/numba.py

flying-sheep · 2026-04-10T10:02:45Z

thank you!

Jhonatan Ramos Felix and others added 4 commits April 8, 2026 22:27

handle broken numba parallel runtime with torch on apple silicon

9edf6cf

added numba runtime tests

075bbe6

raising timeout of parallel runtime

d9eae37

[pre-commit.ci] auto fixes from pre-commit.com hooks

1432f6d

for more information, see https://pre-commit.ci

flying-sheep and others added 4 commits April 9, 2026 11:56

dedupe

38c237d

ruff fix

3d40276

increased coverage cupy

95d51db

[pre-commit.ci] auto fixes from pre-commit.com hooks

7705214

for more information, see https://pre-commit.ci

flying-sheep requested changes Apr 9, 2026

View reviewed changes

src/fast_array_utils/numba.py Show resolved Hide resolved

src/fast_array_utils/numba.py Outdated Show resolved Hide resolved

tests/test_numba.py Outdated Show resolved Hide resolved

JhonatanFelix added 2 commits April 9, 2026 13:35

back to old version, dedupe failed

293cf93

mypy check

0984eba

JhonatanFelix closed this Apr 9, 2026

JhonatanFelix reopened this Apr 9, 2026

removed monkeypatch manual context

d70246f

JhonatanFelix commented Apr 9, 2026

View reviewed changes

new module numba

d6b9aba

flying-sheep changed the title ~~Goal 1 to Fix #4026 (scanpy): apple silicon njit fallback for broken numba parallel runtime with torch~~ fix: apple silicon njit fallback for broken numba parallel runtime with torch Apr 9, 2026

flying-sheep reviewed Apr 9, 2026

View reviewed changes

src/fast_array_utils/numba.py Outdated Show resolved Hide resolved

JhonatanFelix commented Apr 9, 2026

View reviewed changes

src/fast_array_utils/numba.py Show resolved Hide resolved

JhonatanFelix and others added 3 commits April 9, 2026 18:26

Old version with elif

72013b2

remove duplicate

d5a41b8

simplify

8f77c15

flying-sheep added the run-gpu-ci Apply this label to run GPU CI once label Apr 10, 2026

flying-sheep added 2 commits April 10, 2026 10:19

doctest dep

7e69138

simplify

57be99a

flying-sheep changed the title ~~fix: apple silicon njit fallback for broken numba parallel runtime with torch~~ fix: njit fallback for broken runtime with torch Apr 10, 2026

flying-sheep merged commit f07b25f into scverse:main Apr 10, 2026
20 checks passed

This was linked to issues Apr 10, 2026

segmentation fault in normalize_total (Numba omp) on Apple Silicon scverse/scanpy#4026

Closed

segmentation fault (SIGSEGV) when computing neighbors scverse/scanpy#3507

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: njit fallback for broken runtime with torch #164

fix: njit fallback for broken runtime with torch #164
flying-sheep merged 17 commits intoscverse:mainfrom
JhonatanFelix:main

JhonatanFelix commented Apr 8, 2026

Uh oh!

codecov bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

codspeed-hq bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

flying-sheep left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JhonatanFelix commented Apr 9, 2026 •

edited by flying-sheep

Loading

Uh oh!

JhonatanFelix left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

flying-sheep commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JhonatanFelix commented Apr 8, 2026

Uh oh!

codecov bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

codspeed-hq bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Uh oh!

flying-sheep left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JhonatanFelix commented Apr 9, 2026 • edited by flying-sheep Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JhonatanFelix left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

flying-sheep commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Apr 8, 2026 •

edited

Loading

codspeed-hq bot commented Apr 8, 2026 •

edited

Loading

JhonatanFelix commented Apr 9, 2026 •

edited by flying-sheep

Loading

JhonatanFelix left a comment •

edited

Loading