feat: add MiniMax M2.7 support by jQizhang · Pull Request #2685 · NVIDIA-NeMo/RL

jQizhang · 2026-06-04T14:18:07Z

What does this PR do ?

Adds MiniMax M2.7 support for the Automodel DAPO/GRPO workflow.
Related issue: #2251

Summary of code changes:

nemo_rl/models/generation/vllm/quantization/fp8.py - update the vLLM FP8 Ray executor import path used by the local FP8 patching hook.
nemo_rl/models/policy/workers/dtensor_policy_worker_v2.py - call the optional Automodel MoE gate-bias update hook after each optimizer step.
examples/configs/recipes/llm/grpo-minimax-m27-dapo-8n8g-automodel.yaml - add an 8-node, 8-GPU MiniMax M2.7 DAPO recipe using Automodel, vLLM FP8 rollouts, and the DAPOMath datasets.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Signed-off-by: larkzhang-nv <larkz@nvidia.com>

copy-pr-bot · 2026-06-04T14:18:15Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

sharonyu-115 · 2026-06-04T14:22:18Z

/ok to test cb0ff8c

Signed-off-by: larkzhang-nv <larkz@nvidia.com>

jQizhang · 2026-06-07T11:47:39Z

/ok to test 05caacb

copy-pr-bot · 2026-06-07T11:47:42Z

/ok to test 05caacb

@jQizhang, there was an error processing your request: E2

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/2/

jQizhang · 2026-06-07T12:25:07Z

/ok to test 8cf2fa6

jQizhang added 6 commits May 31, 2026 21:08

fix(vllm): update MiniMax FP8 support

c2f87ad

Signed-off-by: larkzhang-nv <larkz@nvidia.com>

Merge remote-tracking branch 'origin/main' into minimax-m2-rebase

73aadc0

Merge remote-tracking branch 'origin/main' into minimax-m2-rebase

523298b

fix(policy): update MoE gate bias after optimizer step

bdc2bd7

Signed-off-by: larkzhang-nv <larkz@nvidia.com>

add MiniMax M2.7 DAPO recipe test

05caacb

Signed-off-by: larkzhang-nv <larkz@nvidia.com>

update MiniMax M2.7 DAPO recipe

cb0ff8c

Signed-off-by: larkzhang-nv <larkz@nvidia.com>

jQizhang requested review from a team as code owners June 4, 2026 14:18

jQizhang requested a review from sharonyu-115 June 4, 2026 14:18

sharonyu-115 added the CI:L1 Run doctests, unit tests, and functional tests label Jun 4, 2026

copy-pr-bot Bot temporarily deployed to public June 4, 2026 14:22 Inactive

copy-pr-bot Bot temporarily deployed to public June 4, 2026 14:23 Inactive

copy-pr-bot Bot temporarily deployed to public June 4, 2026 14:25 Inactive

jQizhang added 3 commits June 5, 2026 07:35

Merge remote-tracking branch 'origin/main' into minimax-m2-rebase

68b32ed

update minimax recipe run name

a139bef

Signed-off-by: larkzhang-nv <larkz@nvidia.com>

Merge branch 'main' into minimax-m2-rebase

8cf2fa6

copy-pr-bot Bot temporarily deployed to public June 7, 2026 12:25 Inactive

copy-pr-bot Bot temporarily deployed to public June 7, 2026 12:29 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add MiniMax M2.7 support#2685

feat: add MiniMax M2.7 support#2685
jQizhang wants to merge 9 commits into
mainfrom
minimax-m2-rebase

jQizhang commented Jun 4, 2026

Uh oh!

copy-pr-bot Bot commented Jun 4, 2026

Uh oh!

sharonyu-115 commented Jun 4, 2026

Uh oh!

jQizhang commented Jun 7, 2026

Uh oh!

copy-pr-bot Bot commented Jun 7, 2026

Uh oh!

jQizhang commented Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jQizhang commented Jun 4, 2026

What does this PR do ?

Before your PR is "Ready for review"

Uh oh!

copy-pr-bot Bot commented Jun 4, 2026

Uh oh!

sharonyu-115 commented Jun 4, 2026

Uh oh!

jQizhang commented Jun 7, 2026

Uh oh!

copy-pr-bot Bot commented Jun 7, 2026

Uh oh!

jQizhang commented Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants