Raise ValueError instead of tearing down CUDA when AuraFlow latents exceed pos_embed_max_size by Ricardo-M-L · Pull Request #13740 · huggingface/diffusers

Ricardo-M-L · 2026-05-13T08:49:12Z

Why

When the input latent grid exceeds the pretrained positional embedding grid, pe_selection_index_based_on_dim silently produces negative / out-of-range gather indices. On CUDA this trips a vectorized_gather_kernel device-side assert, which destroys the CUDA context for the entire process and forces a Python restart.

Fix

Check the bounds up front and raise a `ValueError` with a clear message about the largest supported resolution, matching how `PatchEmbed.cropped_pos_embed` in `models/embeddings.py` handles the same situation for SD3.

```
AuraFlow positional embedding only supports up to N latent tokens
per axis, but got M. Reduce height/width below ...
```

Verification

13 LOC, 1 file. Pure error-path improvement — the happy path is unchanged. Without the fix the failure mode is a permanent CUDA tear-down requiring process restart; with it the user gets a clean exception they can catch.

…xceed pos_embed_max_size When the input latent grid exceeds the pretrained positional embedding grid, pe_selection_index_based_on_dim silently produces negative / out-of-range gather indices. On CUDA this trips a vectorized_gather_kernel device-side assert, which destroys the CUDA context for the entire process and forces a Python restart (see huggingface#12656). Check the bounds up front and raise a ValueError with a clear message about the largest supported resolution, matching how PatchEmbed.cropped_pos_embed in models/embeddings.py handles the same situation for SD3. Fixes huggingface#12656

github-actions Bot added size/S PR with diff < 50 LOC models fixes-issue and removed size/S PR with diff < 50 LOC labels May 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Raise ValueError instead of tearing down CUDA when AuraFlow latents exceed pos_embed_max_size#13740

Raise ValueError instead of tearing down CUDA when AuraFlow latents exceed pos_embed_max_size#13740
Ricardo-M-L wants to merge 1 commit into
huggingface:mainfrom
Ricardo-M-L:fix-auraflow-pe-bounds

Ricardo-M-L commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ricardo-M-L commented May 13, 2026

Why

Fix

Verification

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant