batch mem2reg to process all variables in a single pass by LegNeato · Pull Request #547 · Rust-GPU/rust-gpu

LegNeato · 2026-03-18T03:01:06Z

mem2reg processed each variable independently.
This O(V*N) complexity caused multi-minute hangs on large post-inline functions.

Batch all operations into single passes over the instructions.

Fixes #546. Measured on the proof-of-space-gpu shader: mem2reg drops from 228s to 18s, total link from 236s to 26s.

Also adds unit tests.

Disclosure: largely AI.

LegNeato · 2026-03-18T03:01:18Z

@eddyb can you review this one?

mem2reg processed each variable independently. This O(V*N) complexity caused multi-minute hangs on large post-inline functions. Batch all operations into single passes over the instructions. Fixes Rust-GPU#546. Measured on the proof-of-space-gpu shader: mem2reg drops from 228s to 18s (13x), total link from 236s to 26s. Also adds unit tests.

Firestar99

Looks sane to me, then again, I never looked at our mem2reg pass before. What's really interesting to me are the access patters of this pass, and how it reads and writes instructions, surely useful info for implementing some more efficient data structures for storing a module.

crates/rustc_codegen_spirv/src/linker/mem2reg.rs

The SPIR-V spec allows OpLine/OpNoLine between OpPhi instructions at the start of a block. Account for this in the phi search so it doesn't stop early at a debug instruction. Adds two tests for the phi search boundary behavior.

Add doc comment to construct_access_chain_info explaining that access chain indices must be scalar integers per the SPIR-V spec, and that the constants map only tracks u32 (matching what rustc emits). Add test for a u64 constant index that is valid SPIR-V but not resolved by mem2reg, verifying the variable is not promoted.

nazar-pc · 2026-03-19T23:53:51Z

As big improvement as it is, 18 seconds for mem2reg still seems way too long in absolute terms

LegNeato · 2026-03-21T18:16:41Z

Yeah, I didn't want to do any major changes and just moved multiple loops to batches. I can take a closer look after we get to a steady state.

LegNeato · 2026-03-22T00:47:06Z

@Firestar99 @eddyb any stamp here?

eddyb · 2026-03-23T05:44:09Z

I'm getting back into catching up with Rust-GPU this week (ideally unblocking a release soon), and I'd want to wait for my own PR (last touched ~5 months ago) that both tracks the progress made on #63 and includes a couple of additional optimizations (mostly doing more mem2reg and other clean-up, during inlining).

More specifically, I would like to see the remaining improvement from this PR (after rebasing it on my branch), on the "worst offenders" shaders we have (my old benchmarks + this newer one) - there's a solid chance there will still be some improvement left, but I've been scared of soundness subtleties in the past.

(The idea behind this PR seems fine, and the future SPIR-T "propagate values of locals" pass does handle all local variables together, instead of one at a time, but I haven't reviewed this yet)

LegNeato requested review from Firestar99 and eddyb as code owners March 18, 2026 03:01

LegNeato force-pushed the batch-mem2reg branch from 2040e14 to 6199f8f Compare March 18, 2026 04:20

LegNeato mentioned this pull request Mar 18, 2026

rustc hangs for a few minutes since recent Rust toolchain upgrade #546

Open

Firestar99 reviewed Mar 18, 2026

View reviewed changes

crates/rustc_codegen_spirv/src/linker/mem2reg.rs Outdated Show resolved Hide resolved

crates/rustc_codegen_spirv/src/linker/mem2reg.rs Show resolved Hide resolved

LegNeato added 2 commits March 18, 2026 11:27

mem2reg: handle OpLine/OpNoLine interleaved with OpPhi

35bf77b

The SPIR-V spec allows OpLine/OpNoLine between OpPhi instructions at the start of a block. Account for this in the phi search so it doesn't stop early at a debug instruction. Adds two tests for the phi search boundary behavior.

LegNeato force-pushed the batch-mem2reg branch from 9ca9144 to 7a008fe Compare March 18, 2026 18:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch mem2reg to process all variables in a single pass#547

batch mem2reg to process all variables in a single pass#547
LegNeato wants to merge 3 commits intoRust-GPU:mainfrom
LegNeato:batch-mem2reg

LegNeato commented Mar 18, 2026 •

edited

Loading

Uh oh!

LegNeato commented Mar 18, 2026

Uh oh!

Firestar99 left a comment

Uh oh!

Uh oh!

Uh oh!

nazar-pc commented Mar 19, 2026

Uh oh!

LegNeato commented Mar 21, 2026

Uh oh!

LegNeato commented Mar 22, 2026

Uh oh!

eddyb commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

LegNeato commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LegNeato commented Mar 18, 2026

Uh oh!

Firestar99 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nazar-pc commented Mar 19, 2026

Uh oh!

LegNeato commented Mar 21, 2026

Uh oh!

LegNeato commented Mar 22, 2026

Uh oh!

eddyb commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

LegNeato commented Mar 18, 2026 •

edited

Loading