[ExecuTorch][WebGPU] Add aten.index.Tensor (1D-self gather) by JulianCloudNTH · Pull Request #20461 · pytorch/executorch

JulianCloudNTH · 2026-06-23T20:34:36Z

Stack from ghstack (oldest at bottom):

(to be filled)

Adds the WebGPU delegate handler for aten.index.Tensor, the 1D-self advanced-index
gather out[i] = self[index[i]] (output shape == index shape). This is the form the
VulkanPartitioner delegates -- it requires a 1D self and exactly one non-None index
(op_registry.py); 2D mask/freqs gathers stay on CPU. It mirrors the Vulkan delegate's
index_tensor op (IndexTensor.cpp + index_tensor_buffer.glsl) as a single compute
dispatch over the output elements, each reading the int32 index and gathering the
corresponding fp32 self element.

The op is composed as:

index.wgsl: one workgroup-strided pass, out[i] = self[u32(index[i])], guarded by a
numel bound; buffer-only, fp32 self/out, int32 index, 1D dispatch via the shared
WebGPUUtils helpers (clamp workgroup size + 1D count).
Index.cpp: validates the args (self/out tensors; indices ValueList with exactly one
index tensor; fp32 self/out; int32 index; out numel == index numel), failing loud on
any violation, then records the dispatch. row_width is dropped (always 1 for 1D self).

Differential Revision: D109478967

[ghstack-poisoned]

pytorch-bot · 2026-06-23T20:34:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20461

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[ROCm] MI350 CI jobs will have longer queue times due to CI migration

❌ 6 New Failures, 1 Pending, 1 Unrelated Failure

As of commit 80e6fb0 with merge base 1b726b2 ():

NEW FAILURES - The following jobs have failed:

pull / test-arm-backend-no-driver (test_pytest_ops_tosa) / linux-job (gh)
RuntimeError: Command docker exec -t 6aafab84dd82b4ad1708290153820c460df540ae87154499c20e4f3228e54948 /exec failed with exit code 1
pull / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh)
RuntimeError: Command docker exec -t 6b1694091246206179ef2ce31c0e8259f5d90d8c8753680eafac8e8b7a621a57 /exec failed with exit code 137
pull / unittest / linux / linux-job (gh)
RuntimeError: Command docker exec -t 7231e7aa96da0f5afb331d053510865d51263dc28f4515ed39411d21c184df28 /exec failed with exit code 1
pull / unittest-editable / linux / linux-job (gh)
RuntimeError: Command docker exec -t 9c6c6673b752592e2dd91387f25cf3c659e1422b68261b020452a797bf4b36bc /exec failed with exit code 1
pull / unittest-editable / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
Test QNN Backend / test-qnn / test-backend-linux (qnn, operators) / linux-job (gh)
RuntimeError: Command docker exec -t 41e429dee959f4d46f0dcc15db994fd88870d1727efcbfaa7ad15737789c65d8 /exec failed with exit code 92

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / macos / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-06-23T20:35:31Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Update

80e6fb0

[ghstack-poisoned]

JulianCloudNTH requested review from kirklandsign and larryliu0820 as code owners June 23, 2026 20:34

JulianCloudNTH temporarily deployed to cadence June 23, 2026 20:34 — with GitHub Actions Inactive

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ExecuTorch][WebGPU] Add aten.index.Tensor (1D-self gather)#20461

[ExecuTorch][WebGPU] Add aten.index.Tensor (1D-self gather)#20461
JulianCloudNTH wants to merge 1 commit into
gh/JulianCloudNTH/56/basefrom
gh/JulianCloudNTH/56/head

JulianCloudNTH commented Jun 23, 2026

Uh oh!

pytorch-bot Bot commented Jun 23, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JulianCloudNTH commented Jun 23, 2026

Uh oh!

pytorch-bot Bot commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20461

❗ 1 Active SEVs

❌ 6 New Failures, 1 Pending, 1 Unrelated Failure

Uh oh!

github-actions Bot commented Jun 23, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pytorch-bot Bot commented Jun 23, 2026 •

edited

Loading

This PR needs a `release notes:` label