fix: model dtype is not same as lora dtype in FSDP train by 0hujun · Pull Request #183 · modelscope/twinkle

0hujun · 2026-04-23T09:40:38Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

As described in #182，in ascend npu, model param dtype is bf16， but create lora param dtype is fp32 as default，that got AssertionError: FSDP expects uniform original parameter dtype but got FSDP expects uniform original parameter dtype but got {torch.bfloat16, torch.float32}
So, when lora param has created, convert all LoRA parameters to the base model dtype.

Experiment results

Train fine as usual

gemini-code-assist

Code Review

This pull request introduces the _ensure_lora_dtype method to align LoRA parameter data types with the base model, ensuring compatibility with FSDP2. Review feedback suggests several improvements, including more robust detection of the base data type to handle mixed precision, narrowing exception handling, and wrapping parameter updates in torch.no_grad() for safety.

fix: model dtype not same as lora dtype in FSDP train

c433015

0hujun mentioned this pull request Apr 23, 2026

FSDP2训练LoRA报错FSDP expects uniform original parameter dtype but got {torch.bfloat16, torch.float32} #182

Open

1 task

gemini-code-assist Bot reviewed Apr 23, 2026

View reviewed changes

Comment thread src/twinkle/model/transformers/transformers.py

fix: model dtype is not same as lora dtype in FSDP train

c120c06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: model dtype is not same as lora dtype in FSDP train#183

fix: model dtype is not same as lora dtype in FSDP train#183
0hujun wants to merge 2 commits intomodelscope:mainfrom
0hujun:main

0hujun commented Apr 23, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

0hujun commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR type

PR information

Experiment results

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

0hujun commented Apr 23, 2026 •

edited

Loading