Skip to content

Fix #474: fix: pass all arguments through gradient checkpoint in BasicTransforme#487

Draft
nandanadileep wants to merge 1 commit into
Stability-AI:mainfrom
nandanadileep:agent/issue-474-fix-pass-all-arguments
Draft

Fix #474: fix: pass all arguments through gradient checkpoint in BasicTransforme#487
nandanadileep wants to merge 1 commit into
Stability-AI:mainfrom
nandanadileep:agent/issue-474-fix-pass-all-arguments

Conversation

@nandanadileep

Copy link
Copy Markdown

Fixes #474

Fixed gradient checkpoint in BasicTransformerBlock.forward() to pass all four arguments (x, context, additional_tokens, n_times_crossframe_attn_in_self) to checkpoint(), preventing silent dropping of additional_tokens and n_times_crossframe_attn_in_self when checkpointing is enabled.

Local test infra unavailable in CI sandbox.


This change was prepared with AI assistance under human direction and review.

…oint in Basic

Signed-off-by: nandanadileep <nandanadileep29@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant