Commit f5bb5ce
Add Mixture of Experts (MoE) example
Port of cuTile Python's MoE.py sample with two kernels:
- fused_moe_kernel: tiled matmul with gather/scatter for expert routing
- silu_and_mul_kernel: element-wise SiLU activation
The Julia fused_moe_kernel uses opt_level=0 as a workaround for a
tileiras optimizer crash caused by token loop-carries from gather ops.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 7d3d638 commit f5bb5ce
3 files changed
+799
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
| 4 | + | |
4 | 5 | | |
5 | 6 | | |
6 | 7 | | |
| |||
0 commit comments