Add loop parallel store optimization and DCE pass by maleadt · Pull Request #149 · JuliaGPU/cuTile.jl

maleadt · 2026-03-28T08:56:25Z

Implements two compiler passes that clean up unnecessary token overhead from the alias-aware token ordering pass (#89),
matching cuTile Python's output:

Loop parallel store optimization: Stores in for-loops with injective indices (using the induction variable) use the parent scope's token instead of a loop-carried token, breaking the token dependency chain through
the loop. Matches Python's _try_loop_parallel_store.
Dead code elimination: General-purpose DCE using dependency graph reachability analysis. Removes dead token carries, join_tokens, and unused instructions left behind by the parallel store optimization. Uses Julia's efunc effect annotations to classify intrinsic side effects. Matches Python's dead_code_elimination_pass.

Together these eliminate all dead token loop carries and join_tokens from memory-bound kernels like layernorm, producing token IR structurally identical to cuTile Python (all ops use the root token, zero loop-carried tokens).

Closes #146

maleadt · 2026-03-28T09:15:00Z

Not perfect yet, but I have a couple of things building on top of this so let's merge already.

Add DCE and parallel store optimization for token ordering.

2f0ac38

maleadt marked this pull request as ready for review March 28, 2026 09:14

maleadt merged commit ee913f3 into main Mar 28, 2026
9 checks passed

maleadt deleted the tb/layernorm branch March 28, 2026 09:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add loop parallel store optimization and DCE pass#149

Add loop parallel store optimization and DCE pass#149
maleadt merged 1 commit intomainfrom
tb/layernorm

maleadt commented Mar 28, 2026

Uh oh!

maleadt commented Mar 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

maleadt commented Mar 28, 2026

Uh oh!

maleadt commented Mar 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant