Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Adding adapter changes needed for gemma4 inference.
#3799 opened May 2, 2026 by NicoGrande Collaborator Draft
4 tasks
Internal.
#3795 opened May 1, 2026 by copybara-service Bot Loading…
Add weekly linkchecker and report failures on issue
#3792 opened May 1, 2026 by melissawm Collaborator Loading…
3 tasks done
[WIP] Multimodal quality benchmark
#3784 opened Apr 30, 2026 by hengtaoguo Collaborator Draft
4 tasks
Exclude RNG state when updating vllm with checkpoint for pre-rl evaluation
#3782 opened Apr 30, 2026 by SurbhiJainUSC Collaborator Loading…
4 tasks done
Feat/Support-Qwix-Quantization-For-NNX-Models
#3781 opened Apr 30, 2026 by hsuan-lun-chiang Collaborator Draft
4 tasks
Support specifying tokamax gmm tile sizes in MaxText
#3779 opened Apr 29, 2026 by darisoy Collaborator Loading…
4 tasks done
Reorganize pre-training doc
#3778 opened Apr 29, 2026 by melissawm Collaborator Loading…
2 tasks done
Add unit tests for evaluate_rl.py
#3776 opened Apr 29, 2026 by SurbhiJainUSC Collaborator Loading…
4 tasks done
Remove local sort after ragged all-to-all
#3774 opened Apr 29, 2026 by copybara-service Bot Loading…
[NNX] NNX migration prep (6/N): NNX-native DPO
#3773 opened Apr 29, 2026 by ecnal-cienet Collaborator Draft
4 tasks done
[Distillation] Layer-wise LTI gemini-review
#3769 opened Apr 29, 2026 by vlad-karp Collaborator Loading…
4 tasks done
CI/CD UT optimization
#3753 opened Apr 27, 2026 by charlesli640 Collaborator Loading…
4 tasks done
Add dataset type olmo_grain for AI2 OLMo numpy pretrain mixes gemini-review
#3749 opened Apr 26, 2026 by gagika Collaborator Loading…
4 tasks done
Internal change
#3743 opened Apr 24, 2026 by copybara-service Bot Loading…
Replace hardcoded bucket names with generic ones/env variables
#3738 opened Apr 24, 2026 by melissawm Collaborator Loading…
1 task done
Add fused_moe_mlp: fuse wi_0 and wi_1 into one grouped GEMM for MoE FFN1
#3736 opened Apr 23, 2026 by abhinavgoel95 Contributor Loading…
4 tasks done
Add support for flexible MTP layer architecture in MaxText.
#3734 opened Apr 23, 2026 by parambole Collaborator Draft
4 tasks
Replace references to model_creation_utils.create_nnx_model.
#3720 opened Apr 22, 2026 by copybara-service Bot Loading…
4 tasks done
[New Model Bringup] Initial Commit to enable Text-only architecture for Qwen3.5
#3712 opened Apr 21, 2026 by Rohan-Bierneni Collaborator Loading…
4 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.