Refactor custom_splash_attention for improved clarity by Perseus14 · Pull Request #431 · AI-Hypercomputer/maxdiffusion

Perseus14 · 2026-06-29T06:30:52Z

Overview

This PR refactors and cleans up the custom Splash attention kernels by properly encapsulating the inner-VPU tiling step (block_kv_compute_in) and removing redundant parameters. It also aligns the newly introduced custom Ring Attention implementations with these cleaned-up signatures.

Changes

custom_splash_attention.py:
- Added block_kv_compute_in directly to the _BlockSizes dataclass, streamlining parameter passing.
- Removed unused high-level attention wrappers (tpu_custom_attention, make_custom_splash_sdpa) since orchestration is now fully handled in attention_flax.py.
- Cleaned up _flash_attention_kernel and _splash_attention_forward_ring by stripping out redundant arguments like bq, q_seq_len, and explicit bkv_compute_in passes.
ring_attention_kernel.py:
- Updated the function signatures for make_custom_ring_attention, _custom_ring_attention_forward, and _custom_bidirectional_ring_forward to drop the explicit bkv_compute_in argument, correctly extracting it from block_sizes instead.
attention_flax.py:
- Updated all instantiations of custom_splash._BlockSizes to pass block_kv_compute_in.
- Fixed downstream calls to make_splash_mha and make_custom_ring_attention to remove the now-redundant bkv_compute_in kwarg.

Impact

Reduces parameter bloat across the low-level Pallas kernels.
Ensures API consistency across standard Ulysses attention and Tokamax Ring Attention.

github-actions · 2026-06-29T06:31:02Z

e2e testgrid: https://8bcf50593faf4ea38060e236169827e5-dot-us-central1.composer.googleusercontent.com/dags/maxdiffusion_tpu_e2e/grid

eltsai

Thanks for cleaning this up @Perseus14 ! LGTM

Perseus14 · 2026-07-01T18:08:24Z

@eltsai Could you check again? I have made minor cleanup to ring attention implementation as well

Perseus14 requested a review from entrpn as a code owner June 29, 2026 06:30

Perseus14 self-assigned this Jun 29, 2026

Perseus14 requested review from csgoogle and eltsai June 29, 2026 06:57

csgoogle previously approved these changes Jun 29, 2026

View reviewed changes

github-actions Bot added the pull ready label Jun 29, 2026

entrpn reviewed Jun 30, 2026

View reviewed changes

Comment thread src/maxdiffusion/kernels/custom_splash_attention.py

entrpn previously approved these changes Jun 30, 2026

View reviewed changes

eltsai previously approved these changes Jun 30, 2026

View reviewed changes

Perseus14 dismissed stale reviews from eltsai, entrpn, and csgoogle via 632e774 July 1, 2026 18:04

Perseus14 force-pushed the custom_attn_fix branch from fb64673 to 632e774 Compare July 1, 2026 18:04

Perseus14 requested review from csgoogle and eltsai July 1, 2026 18:08

Perseus14 force-pushed the custom_attn_fix branch from 632e774 to f6c2c11 Compare July 1, 2026 18:16

Cleaning up custom_splash_attention

fd56996

Perseus14 force-pushed the custom_attn_fix branch from f6c2c11 to fd56996 Compare July 1, 2026 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor custom_splash_attention for improved clarity#431

Refactor custom_splash_attention for improved clarity#431
Perseus14 wants to merge 1 commit into
mainfrom
custom_attn_fix

Perseus14 commented Jun 29, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 29, 2026

Uh oh!

Uh oh!

eltsai left a comment

Uh oh!

Perseus14 commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Perseus14 commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Changes

Impact

Uh oh!

github-actions Bot commented Jun 29, 2026

Uh oh!

Uh oh!

eltsai left a comment

Choose a reason for hiding this comment

Uh oh!

Perseus14 commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Perseus14 commented Jun 29, 2026 •

edited

Loading