Add llm_methods metadata to multimethod PTE export by kimishpatel · Pull Request #18773 · pytorch/executorch

kimishpatel · 2026-04-08T14:30:04Z

Summary:
Add phase-aware method metadata to PTE files during multimethod export.
When MethodConfig entries have a phase tag ("prefill" or "decode"), the
export pipeline writes llm_methods_prefill and llm_methods_decode
constant methods into the PTE. This enables the runtime to discover which
methods to call for each inference phase without hardcoding names.

Changes:

MethodConfig: add optional phase field
_build_yoco_multimethod_config: tag methods with phase="prefill"/"decode"
_export_llm_backbone_multimethod: collect phase-tagged method names
_lower_and_save_multimethod: accept extra_metadata parameter
constants.h: add kLlmMethodsPrefill/kLlmMethodsDecode keys

Differential Revision: D99689421

Summary: Add phase-aware method metadata to PTE files during multimethod export. When MethodConfig entries have a `phase` tag ("prefill" or "decode"), the export pipeline writes `llm_methods_prefill` and `llm_methods_decode` constant methods into the PTE. This enables the runtime to discover which methods to call for each inference phase without hardcoding names. Changes: - MethodConfig: add optional `phase` field - _build_yoco_multimethod_config: tag methods with phase="prefill"/"decode" - _export_llm_backbone_multimethod: collect phase-tagged method names - _lower_and_save_multimethod: accept extra_metadata parameter - constants.h: add kLlmMethodsPrefill/kLlmMethodsDecode keys Differential Revision: D99689421

pytorch-bot · 2026-04-08T14:30:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18773

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 15 New Failures, 2 Unrelated Failures

As of commit ce726f6 with merge base e638059 ():

NEW FAILURES - The following jobs have failed:

Cadence Build & Test / cpu-test / test-aot / test-aot (gh)
backends/cadence/aot/tests/test_replace_ops_passes.py::TestReplaceOpsPasses::test_replace_conv2d_with_linear
MLX / test-mlx-llm (unsloth/gemma-3-1b-it, gemma3-1b, false, 4w) / test-mlx-llm-gemma3-1b-4w (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/gemma-3-1b-it, gemma3-1b, false, nvfp4) / test-mlx-llm-gemma3-1b-nvfp4 (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/gemma-3-1b-it, gemma3-1b, true, 4w) / test-mlx-llm-gemma3-1b-custom-4w (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/gemma-3-1b-it, gemma3-1b, true, nvfp4) / test-mlx-llm-gemma3-1b-custom-nvfp4 (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/Llama-3.2-1B-Instruct, llama-1b, false, 4w) / test-mlx-llm-llama-1b-4w (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/Llama-3.2-1B-Instruct, llama-1b, false, nvfp4) / test-mlx-llm-llama-1b-nvfp4 (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/Llama-3.2-1B-Instruct, llama-1b, true, 4w) / test-mlx-llm-llama-1b-custom-4w (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/Llama-3.2-1B-Instruct, llama-1b, true, nvfp4) / test-mlx-llm-llama-1b-custom-nvfp4 (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/Qwen3-0.6B, qwen3-0.6b, false, 4w) / test-mlx-llm-qwen3-0.6b-4w (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/Qwen3-0.6B, qwen3-0.6b, false, nvfp4) / test-mlx-llm-qwen3-0.6b-nvfp4 (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/Qwen3-0.6B, qwen3-0.6b, true, 4w) / test-mlx-llm-qwen3-0.6b-custom-4w (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-llm (unsloth/Qwen3-0.6B, qwen3-0.6b, true, nvfp4) / test-mlx-llm-qwen3-0.6b-custom-nvfp4 (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2
MLX / test-mlx-voxtral-realtime / test-mlx-voxtral-realtime (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 1
MLX / test-mlx-whisper / test-mlx-whisper (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-04-08T14:30:14Z

@kimishpatel has exported this pull request. If you are a Meta employee, you can view the originating Diff in D99689421.

github-actions · 2026-04-08T14:30:59Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

kimishpatel requested review from larryliu0820 and mergennachin as code owners April 8, 2026 14:30

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 8, 2026

meta-codesync bot added fb-exported meta-exported labels Apr 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add llm_methods metadata to multimethod PTE export#18773

Add llm_methods metadata to multimethod PTE export#18773
kimishpatel wants to merge 1 commit intopytorch:mainfrom
kimishpatel:export-D99689421

kimishpatel commented Apr 8, 2026

Uh oh!

pytorch-bot bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

meta-codesync bot commented Apr 8, 2026

Uh oh!

github-actions bot commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kimishpatel commented Apr 8, 2026

Uh oh!

pytorch-bot bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18773

❌ 15 New Failures, 2 Unrelated Failures

Uh oh!

meta-codesync bot commented Apr 8, 2026

Uh oh!

github-actions bot commented Apr 8, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pytorch-bot bot commented Apr 8, 2026 •

edited

Loading

This PR needs a `release notes:` label