-
Notifications
You must be signed in to change notification settings - Fork 415
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ci: Bump Megatron-Bridge to 823b951
CI:L1
Run doctests, unit tests, and functional tests
#2735
opened Jun 7, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix: fix grpo-gptoss-20b-8n8g-megatron
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2734
opened Jun 7, 2026 by
yuki-97
Contributor
Loading…
[TRAIN-8] Add end-to-end custom reward functions tutorial
community-request
Documentation
Improvements or additions to documentation
#2733
opened Jun 7, 2026 by
mkcash
Loading…
fix: fix qwen3-235b deepseek-v3 h100 perf tests
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2703
opened Jun 5, 2026 by
yuki-97
Contributor
Loading…
feat(algorithms): SingleController streaming train_pump (split-API consumer)
CI:L0
Run doctests and unit tests
#2700
opened Jun 5, 2026 by
mehraakash
Loading…
2 of 3 tasks
feat(policy): split-API train-step state machine on DTensor v1/v2
CI:L0
Run doctests and unit tests
#2692
opened Jun 4, 2026 by
mehraakash
Loading…
3 tasks
chore: bump Gym workspace to v0.3.0
CI:L1
Run doctests, unit tests, and functional tests
#2691
opened Jun 4, 2026 by
kajalj22
Contributor
Loading…
2 tasks
test: guard the MXFP8 flashinfer trtllm-gen MoE fast path from silent regressions
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2689
opened Jun 4, 2026 by
puririshi98
Loading…
feat: add MiniMax M2.7 support
CI:L1
Run doctests, unit tests, and functional tests
#2685
opened Jun 4, 2026 by
jQizhang
Contributor
Loading…
3 of 4 tasks
feat(megatron): split-API train-step state machine on MegatronPolicyWorker
CI:L0
Run doctests and unit tests
#2683
opened Jun 4, 2026 by
mehraakash
Loading…
4 tasks
feat: GDPO vs GRPO multi-reward example, collapse demo, and experiment skill
#2681
opened Jun 3, 2026 by
anjalibshah
Loading…
feat: configurable GDPO per-reward weights and multi-reward NeMo Gym bridge
Documentation
Improvements or additions to documentation
#2680
opened Jun 3, 2026 by
anjalibshah
Loading…
ci: bump _release_library.yml to v1.4.3
CI
Relating to CI
#2678
opened Jun 3, 2026 by
ko3n1g
Contributor
Loading…
feat: Enable tqdm configuration for vllm generation
community-request
waiting-on-customer
Waiting on the original author to respond
#2677
opened Jun 3, 2026 by
louisfaury
Loading…
2 of 4 tasks
chore: bump Relating to CI
_code_freeze workflow to v1.4.2
CI
#2675
opened Jun 3, 2026 by
ko3n1g
Contributor
Loading…
feat: Add script to re-initialize near-zero HF embeddings
Documentation
Improvements or additions to documentation
#2671
opened Jun 2, 2026 by
ashors1
Contributor
Loading…
4 tasks
fix: resolve qwen3.5-35ba3b megatron ep16 OOM via TP=2 (#2619)
#2668
opened Jun 2, 2026 by
sharonyu-115
Contributor
Loading…
4 tasks
chore: bump transfomrers 5.5
CI:L1
Run doctests, unit tests, and functional tests
#2667
opened Jun 2, 2026 by
yuekaizhang
Contributor
Loading…
test(converters): add CLI entry-point coverage for all three converte…
community-request
waiting-on-customer
Waiting on the original author to respond
#2666
opened Jun 2, 2026 by
SakethKoona
Loading…
3 of 4 tasks
fix(security): bump deps for CVE remediation (June 2026)
CI:L1
Run doctests, unit tests, and functional tests
fix(nrl-k8s): remove SA impersonation from dev pod RBAC check
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2655
opened Jun 1, 2026 by
terrykong
Collaborator
Loading…
2 tasks done
refactor: refactor generation config
CI:L1
Run doctests, unit tests, and functional tests
Documentation
Improvements or additions to documentation
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.