[Discrete Diffusion] Add LLaDA2 pipeline by kashif · Pull Request #13226 · huggingface/diffusers

kashif · 2026-03-08T17:44:13Z

Add support for LLaDA2/LLaDA2.1 discrete diffusion text generation:

BlockRefinementPipeline: block-wise iterative refinement with confidence-based token commitment, supporting editing threshold for LLaDA2.1 models
LLaDA2Pipeline: convenience wrapper with LLaDA2-specific defaults
DiscreteDiffusionPipelineMixin: shared SAR sampling utilities (top-k, top-p, temperature) and prompt/prefix helpers
compute_confidence_aware_loss: CAP-style training loss
Examples: sampling scripts for LLaDA2 and block refinement, training scripts with Qwen causal LM
Docs and tests included

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…usion Add support for LLaDA2/LLaDA2.1 discrete diffusion text generation: - BlockRefinementPipeline: block-wise iterative refinement with confidence-based token commitment, supporting editing threshold for LLaDA2.1 models - LLaDA2Pipeline: convenience wrapper with LLaDA2-specific defaults - DiscreteDiffusionPipelineMixin: shared SAR sampling utilities (top-k, top-p, temperature) and prompt/prefix helpers - compute_confidence_aware_loss: CAP-style training loss - Examples: sampling scripts for LLaDA2 and block refinement, training scripts with Qwen causal LM - Docs and tests included

Extract the confidence-based token commit logic from BlockRefinementPipeline into a dedicated BlockRefinementScheduler, following diffusers conventions. The scheduler owns: - Transfer schedule computation (get_num_transfer_tokens) - Timestep management (set_timesteps) - Step logic: confidence-based mask-filling and optional token editing The pipeline now delegates scheduling to self.scheduler.step() and accepts a scheduler parameter in __init__.

12 tests covering set_timesteps, get_num_transfer_tokens, step logic (confidence-based commits, threshold behavior, editing, prompt masking, batched inputs, tuple output).

- Add BlockRefinement and LLaDA2 to docs sidebar navigation - Add BlockRefinementScheduler to schedulers sidebar navigation - Move scheduler autodoc to its own page under api/schedulers/

kashif added 3 commits March 8, 2026 17:43

test: add unit tests for BlockRefinementScheduler

f8220db

12 tests covering set_timesteps, get_num_transfer_tokens, step logic (confidence-based commits, threshold behavior, editing, prompt masking, batched inputs, tuple output).

kashif force-pushed the llada2-support branch from 6718843 to f8220db Compare March 8, 2026 18:22

docs: add toctree entries and standalone scheduler doc page

bbc3592

- Add BlockRefinement and LLaDA2 to docs sidebar navigation - Add BlockRefinementScheduler to schedulers sidebar navigation - Move scheduler autodoc to its own page under api/schedulers/

kashif changed the title ~~[Discrete Diffusion] Add LLaDA pipeline~~ [Discrete Diffusion] Add LLaDA2 pipeline Mar 8, 2026

kashif mentioned this pull request Mar 8, 2026

Discrete diffusion in diffusers #12911

Draft

6 tasks

kashif requested a review from dg845 March 8, 2026 19:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Discrete Diffusion] Add LLaDA2 pipeline#13226

[Discrete Diffusion] Add LLaDA2 pipeline#13226
kashif wants to merge 4 commits intohuggingface:mainfrom
kashif:llada2-support

kashif commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kashif commented Mar 8, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant