Code recipes train models for code generation tasks with M2PO.
Run one example from the repo root:
bash examples/code/qwen3-8b-m2po-delta/scripts/run_qwen3-8b-m2po-delta.shComplete guidance: docs/en/recipes/code.md.
GPU Resources
These recipes default to one 8xH100 node.