Add Auto-FL report skill by holgerroth · Pull Request #4845 · NVIDIA/NVFlare

holgerroth · 2026-06-30T18:07:16Z

Summary

Adds a productized nvflare-autofl-report companion skill for generating reproducible final artifacts after an Auto-FL campaign has stopped, reached a cap, hit a hard blocker, or been manually interrupted.

This is a follow-up to #4780 and is intentionally stacked on that branch. The new work is contained in commit 08105a368; once #4780 lands, this PR's diff will collapse to the reporting feature only.

User Experience

After stopping a campaign, the user can ask their coding agent:

Use the NVFlare Auto-FL Report skill.
Generate the final report for the stopped campaign in ./job.

The skill verifies stopped state and deterministically produces:

autofl_final_report.md
autofl_report_summary.json
a refreshed progress.png

The report includes baseline/best results, candidate lineage and inherited code changes, manifests and hashes, exact commands, runtime/failures, literature checkpoints with measured follow-on outcomes, and comparability warnings.

Design

Refuses to finalize state with final_response_allowed=false unless the user explicitly confirms an abrupt interruption after execution is independently checked.
Does not mutate job source, results.tsv, candidate manifests, or campaign state.
Works without Git and does not auto-commit.
Reuses the product Auto-FL progress plotter instead of copying research plotting logic.
Distinguishes the imported autofl.yaml budget from executed baseline/best commands.
Warns when training compute changed or a test-like metric guided repeated candidate selection.
Associates each literature checkpoint with subsequent measured candidates until the next checkpoint and preserves campaign-recorded source identifiers without claiming independent citation verification.

No files under research/auto-fl-research and no H100-specific assets or instructions are changed.

Validation

72 passed across report, Auto-FL runner/guard/plotter, importer, skill admission, and release-bundle tests.
Black, isort, flake8, and git diff --check pass.
Full docs HTML build completes; remaining warnings are pre-existing elsewhere in the docs tree.
Forward-tested against a copied 149-row, 8-client CIFAR-10 campaign ledger: the report reproduced the 0.6870 baseline, 0.8218 best score, 34-candidate retained lineage, 10 literature checkpoints, and the executed local-epoch comparability warning without touching the live campaign.

Dependency

Depends on Add Auto-FL agent skill #4780 (nvflare-autofl, candidate manifests, campaign state, ledger, and progress plotter).

Signed-off-by: Holger Roth <hroth@nvidia.com>

codecov-commenter · 2026-06-30T18:15:35Z

Codecov Report

❌ Patch coverage is 87.20682% with 60 lines in your changes missing coverage. Please review.
✅ Project coverage is 56.68%. Comparing base (7df6a4c) to head (23f6a6e).

Files with missing lines	Patch %	Lines
nvflare/app_common/autofl/job_importer.py	87.04%	60 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4845      +/-   ##
==========================================
+ Coverage   56.49%   56.68%   +0.18%     
==========================================
  Files         969      971       +2     
  Lines       92210    92679     +469     
==========================================
+ Hits        52096    52535     +439     
- Misses      40114    40144      +30

Flag	Coverage Δ
unit-tests	`56.68% <87.20%> (+0.18%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Signed-off-by: Holger Roth <hroth@nvidia.com>

holgerroth and others added 24 commits June 8, 2026 16:00

Add Auto-FL agent skill importer

9b86388

Clarify Auto-FL campaign config role

5e87fa2

Move Auto-FL skill to root skills layout

a29f84f

Merge branch 'main' into codex/autofl-skill-v1

2146a8e

Address Auto-FL importer review feedback

989bf45

Mark unresolved Auto-FL importer names

1d64aa4

Keep Auto-FL skill campaigns running

ead5ff2

Merge upstream main into Auto-FL skill PR

361e639

Fix Auto-FL skill release packaging references

cb2bf31

Address Auto-FL Greptile review findings

46c4d5c

Add Auto-FL skill campaign guard

2e287b4

Report CIFAR Auto-FL score as test accuracy

780fe21

Standardize Auto-FL optimization metric contract

84be7f9

Lower Auto-FL literature watchdog default

4c1706f

Fix uncapped Auto-FL campaign stopping

1480e1b

Signed-off-by: Holger Roth <hroth@nvidia.com>

Merge remote-tracking branch 'upstream/main' into codex/autofl-skill-v1

abf6384

Add Auto-FL campaign progress visual

1782ba5

Signed-off-by: Holger Roth <hroth@nvidia.com>

Remove local Auto-FL validation scaffolding

1a93d94

Signed-off-by: Holger Roth <hroth@nvidia.com>

Make Auto-FL code candidates first-class

3012a2f

Signed-off-by: Holger Roth <hroth@nvidia.com>

Restore Auto-FL workspace on schema errors

28f1a9f

Signed-off-by: Holger Roth <hroth@nvidia.com>

Harden Auto-FL candidate error recovery

0020b39

Signed-off-by: Holger Roth <hroth@nvidia.com>

Make Auto-FL candidate finalization transactional

ebc0c05

Signed-off-by: Holger Roth <hroth@nvidia.com>

Improve Auto-FL campaign progress plots

6a3cd27

Add Auto-FL stopped campaign report skill

08105a3

Signed-off-by: Holger Roth <hroth@nvidia.com>

holgerroth force-pushed the codex/autofl-report-skill branch from 48ca4f7 to 08105a3 Compare June 30, 2026 18:07

holgerroth changed the title ~~Add Auto-FL stopped campaign report skill~~ Add Auto-FL report skill Jul 1, 2026

holgerroth and others added 2 commits July 1, 2026 15:45

Merge branch 'main' into codex/autofl-report-skill

43655ed

Harden Auto-FL final report semantics

23f6a6e

Signed-off-by: Holger Roth <hroth@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Auto-FL report skill#4845

Add Auto-FL report skill#4845
holgerroth wants to merge 26 commits into
NVIDIA:mainfrom
holgerroth:codex/autofl-report-skill

holgerroth commented Jun 30, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Jun 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

holgerroth commented Jun 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

User Experience

Design

Validation

Dependency

Uh oh!

codecov-commenter commented Jun 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

holgerroth commented Jun 30, 2026 •

edited

Loading

codecov-commenter commented Jun 30, 2026 •

edited

Loading