[WIP] Update `diffusers-cli` for agentic use by DN6 · Pull Request #13966 · huggingface/diffusers

DN6 · 2026-06-15T12:31:47Z

What does this PR do?

Some updates to the diffusers-cli to make it more agent friendly. This PR

Adds a diffusers-cli skill to showcase the features available via the CLI and how to use them
Adds a describe command that can we used to extract the inputs of a pipeline from an input repo id
Adds a generate command that runs inference with any diffusers compatible pipelines. It also provides a number of optimization options (CP, cpu/group offload) + LoRA and allows running inference remotely on HF jobs.

Fixes # (issue)

Before submitting

Did you use an AI agent (Claude Code, Codex, Cursor, etc.) to help with this PR? If so:
- Did you point it at the project conventions in .ai/ (e.g. via make claude / make codex)? See Coding with AI agents.
- Did you self-review the diff against .ai/review-rules.md?
Did you read the contributor guideline?
Did you read our philosophy doc? (important for complex PRs)
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2026-06-15T12:41:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2026-06-16T05:01:05Z

+
+- Multi-stage workflows where you need intermediate tensor manipulation between pipelines → write Python.
+- Training or fine-tuning → CLI only covers inference.
+- Anything requiring custom `device_map`, `quantization_config`, or other low-level loader knobs not exposed by


Feels like quantization could be exposed to the CLI. Right now, one can only do that when using a prequantized checkpoint?

Quantization has a fairly large API surface that might be better suited to writing a dedicated quantization script? e.g BnB quant config options have no overlap with TorchAO which in turn have no overlap with ModelOpt etc etc. TorchAO also supports using AOBaseConfig input which in turn has it's own input args.

We could explore trying to provide the option via a more restricted API though.

No your reasoning makes sense. It's just that a user could expect it because quantization is sometimes the only way to do it locally. We can table it for now.

sayakpaul · 2026-06-16T05:21:16Z

+    parser.add_argument("--vae-tiling", action="store_true", help="Enable VAE tiling (lower peak VRAM).")
+    parser.add_argument("--vae-slicing", action="store_true", help="Enable VAE slicing (lower peak VRAM).")
+    parser.add_argument(
+        "--context-parallel",


How does it interact with --remote?

I'm not sure I follow?

How --context-parallel interact with --remote? Like do we want the users to run context parallel inference in case HF Jobs don't support it? Or do we want to just delegate to HF Jobs and propagate if there are errors?

sayakpaul · 2026-06-16T05:48:58Z

Generated the following with the CLI:

diffusers-cli generate -m black-forest-labs/FLUX.1-dev \
  --device cuda --dtype bf16 --seed 42 -o outputs/dog_moon.png \
  --pipeline-kwargs '{"prompt":"realistic photo of a dog walking down the surface of moon","guidance_scale":3.5,"num_inference_steps":50}'

Nice little summary:

generate
  task: generate
  model: black-forest-labs/FLUX.1-dev
  device: cuda
  pipeline_class: FluxPipeline
  modular: False
  outputs: ['outputs/dog_moon.png']
  seed: 42

Final output:

I think we could also add lightweight testing around these things just to ensure consistency and that the right inputs are being passed.

github-actions · 2026-06-16T09:19:26Z

Hi @DN6, thanks for the PR! It does not appear to link an issue it fixes. If this PR addresses an existing issue, please add a closing keyword (e.g. Fixes #1234) to the PR description so the issue is linked. See the contribution guide for more details. If this PR intentionally does not fix a tracked issue, a maintainer can add the no-issue-needed label to silence this reminder.

DN6 added 24 commits June 1, 2026 23:55

update

e84a3ef

update

59be753

update

4194c39

update

d8eb952

update

95f33c7

update

accfa06

update

4d4d9e8

update

f97aef8

update

3774951

update

add747b

update

934b557

update

0ae1eb0

update

dcfd09c

Merge remote-tracking branch 'origin' into diffuser-cli-for-agent

2221383

update

9515c55

update

404be8a

update

f3fa589

update

633461d

update

268bae9

update

fa7a0a2

update

55e1c14

update

6ba7a3f

update

6f02aed

update

889f646

github-actions Bot added size/L PR with diff > 200 LOC utils labels Jun 15, 2026

sayakpaul reviewed Jun 16, 2026

View reviewed changes

pdate

ab70d69

DN6 and others added 4 commits June 16, 2026 16:33

update

af8cbf4

update

b50dae1

update

1d6f5b3

Merge branch 'main' into diffuser-cli-for-agent

46849ae

Conversation

DN6 commented Jun 15, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Jun 15, 2026

Uh oh!

Uh oh!

Uh oh!

sayakpaul Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

DN6 Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sayakpaul Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

DN6 Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sayakpaul commented Jun 16, 2026

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants