Skip to content

build: Upgrade to unsloth 2026.3.3#604

Merged
vivekkalyan merged 1 commit intomainfrom
chore/unsloth-2026-3-3-gc-default
Mar 5, 2026
Merged

build: Upgrade to unsloth 2026.3.3#604
vivekkalyan merged 1 commit intomainfrom
chore/unsloth-2026-3-3-gc-default

Conversation

@vivekkalyan
Copy link
Collaborator

@vivekkalyan vivekkalyan commented Mar 5, 2026

Upgrade to unsloth==2026.3.3 and unsloth-zoo==2026.3.1

Problem

With newer unsloth versions, we were getting OOMs on certain training runs (which wasn't present in previous versions).

Solution

With older unsloth (2025.12.9), RL training mode effectively used model training defaults (model.for_training() with no explicit trainer gradient checkpoint arg path), which behaved memory-safe in ART.

With newer unsloth (2026.x), RL training mode reads trainer args and passes gradient_checkpointing explicitly into for_training(...), and this was reading TRL defaults (effectively False), which disabled gradient checkpointing and caused OOM.

So we are explicitly setting the default for gradient_checkpointing to be True

@vivekkalyan vivekkalyan changed the title Fix unsloth 2026.3.3 upgrade by forcing gradient checkpointing build: Upgrade to unsloth 2026.3.3 Mar 5, 2026
@vivekkalyan vivekkalyan force-pushed the chore/unsloth-2026-3-3-gc-default branch from 8a09100 to 95f87d2 Compare March 5, 2026 02:18
@vivekkalyan vivekkalyan marked this pull request as ready for review March 5, 2026 02:51
@vivekkalyan vivekkalyan requested a review from Kovbo March 5, 2026 02:51
@vivekkalyan vivekkalyan merged commit 43a6ed0 into main Mar 5, 2026
2 of 3 checks passed
@vivekkalyan vivekkalyan deleted the chore/unsloth-2026-3-3-gc-default branch March 5, 2026 02:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants