Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ci: Bump Megatron-Bridge to 823b951 CI:L1 Run doctests, unit tests, and functional tests
#2735 opened Jun 7, 2026 by svcnvidia-nemo-ci Contributor Loading…
fix: fix grpo-gptoss-20b-8n8g-megatron CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2734 opened Jun 7, 2026 by yuki-97 Contributor Loading…
[TRAIN-8] Add end-to-end custom reward functions tutorial community-request Documentation Improvements or additions to documentation
#2733 opened Jun 7, 2026 by mkcash Loading…
fix: fix qwen3-235b deepseek-v3 h100 perf tests CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2703 opened Jun 5, 2026 by yuki-97 Contributor Loading…
feat(algorithms): SingleController streaming train_pump (split-API consumer) CI:L0 Run doctests and unit tests
#2700 opened Jun 5, 2026 by mehraakash Loading…
2 of 3 tasks
feat(policy): split-API train-step state machine on DTensor v1/v2 CI:L0 Run doctests and unit tests
#2692 opened Jun 4, 2026 by mehraakash Loading…
3 tasks
chore: bump Gym workspace to v0.3.0 CI:L1 Run doctests, unit tests, and functional tests
#2691 opened Jun 4, 2026 by kajalj22 Contributor Loading…
2 tasks
feat: add MiniMax M2.7 support CI:L1 Run doctests, unit tests, and functional tests
#2685 opened Jun 4, 2026 by jQizhang Contributor Loading…
3 of 4 tasks
feat(megatron): split-API train-step state machine on MegatronPolicyWorker CI:L0 Run doctests and unit tests
#2683 opened Jun 4, 2026 by mehraakash Loading…
4 tasks
feat: configurable GDPO per-reward weights and multi-reward NeMo Gym bridge Documentation Improvements or additions to documentation
#2680 opened Jun 3, 2026 by anjalibshah Loading…
ci: bump _release_library.yml to v1.4.3 CI Relating to CI
#2678 opened Jun 3, 2026 by ko3n1g Contributor Loading…
feat: Enable tqdm configuration for vllm generation community-request waiting-on-customer Waiting on the original author to respond
#2677 opened Jun 3, 2026 by louisfaury Loading…
2 of 4 tasks
chore: bump _code_freeze workflow to v1.4.2 CI Relating to CI
#2675 opened Jun 3, 2026 by ko3n1g Contributor Loading…
Add nano_v3 vLLM reasoning parser plugin
#2673 opened Jun 2, 2026 by dpickem Loading…
4 tasks
feat: Add script to re-initialize near-zero HF embeddings Documentation Improvements or additions to documentation
#2671 opened Jun 2, 2026 by ashors1 Contributor Loading…
4 tasks
fix: Fix fp8 memory fragmentation
#2670 opened Jun 2, 2026 by ashors1 Contributor Loading…
4 tasks
fix: resolve qwen3.5-35ba3b megatron ep16 OOM via TP=2 (#2619)
#2668 opened Jun 2, 2026 by sharonyu-115 Contributor Loading…
4 tasks
chore: bump transfomrers 5.5 CI:L1 Run doctests, unit tests, and functional tests
#2667 opened Jun 2, 2026 by yuekaizhang Contributor Loading…
test(converters): add CLI entry-point coverage for all three converte… community-request waiting-on-customer Waiting on the original author to respond
#2666 opened Jun 2, 2026 by SakethKoona Loading…
3 of 4 tasks
fix(security): bump deps for CVE remediation (June 2026) CI:L1 Run doctests, unit tests, and functional tests
#2663 opened Jun 2, 2026 by kajalj22 Contributor Draft
4 tasks
fix(nrl-k8s): remove SA impersonation from dev pod RBAC check CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2655 opened Jun 1, 2026 by terrykong Collaborator Loading…
2 tasks done
refactor: refactor generation config CI:L1 Run doctests, unit tests, and functional tests Documentation Improvements or additions to documentation
#2653 opened Jun 1, 2026 by yuki-97 Contributor Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.