Commit 54401a5
committed
Update on "Refactor
Summary:
This is to prefer the addition of flashinfer quantize kernel path in next PR
Test Plan:
python test/prototype/mx_formats/test_inference_workflow.py
Reviewers:
Subscribers:
Tasks:
Tags:
[ghstack-poisoned]use_triton_kernel to use nvfp4_quantize_kernel_choice"2 files changed
Lines changed: 2 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
188 | 188 | | |
189 | 189 | | |
190 | 190 | | |
191 | | - | |
| 191 | + | |
192 | 192 | | |
193 | 193 | | |
194 | 194 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | | - | |
3 | | - | |
4 | | - | |
5 | 2 | | |
| 3 | + | |
6 | 4 | | |
7 | 5 | | |
8 | 6 | | |
| |||
0 commit comments