Skip to content

[LTX2] Add LTX2 Audio VAE#348

Merged
copybara-service[bot] merged 1 commit intomainfrom
ltx2_audio_vae
Mar 13, 2026
Merged

[LTX2] Add LTX2 Audio VAE#348
copybara-service[bot] merged 1 commit intomainfrom
ltx2_audio_vae

Conversation

@Perseus14
Copy link
Collaborator

@Perseus14 Perseus14 commented Mar 6, 2026

This PR introduces the FlaxAutoencoderKLLTX2Audio model, providing a complete Flax/NNX implementation of the Audio VAE required for the LTX-2 pipeline. It supports encoding and decoding of audio spectrograms into latent representations, featuring fully configurable causal convolutions, dynamic spatial/temporal scaling, and strict parameter alignment for upstream checkpoint compatibility.

@Perseus14 Perseus14 requested a review from entrpn as a code owner March 6, 2026 12:24
@github-actions
Copy link

github-actions bot commented Mar 6, 2026

@Perseus14 Perseus14 changed the title Add LTX2 Audio VAE [LTX2] Add LTX2 Audio VAE Mar 6, 2026
@Perseus14
Copy link
Collaborator Author

@mbohlool PTAL

@Perseus14 Perseus14 requested a review from mbohlool March 12, 2026 20:19
mbohlool
mbohlool previously approved these changes Mar 13, 2026
Copy link
Collaborator

@mbohlool mbohlool left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving to unblock you after you added the license header.

@Perseus14 Perseus14 requested a review from mbohlool March 13, 2026 10:24
@copybara-service copybara-service bot merged commit f08341e into main Mar 13, 2026
31 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants