Commit 8696dc1
committed
Update base for Update on "Refactor
Summary:
This is to prefer the addition of flashinfer quantize kernel path in next PR
Test Plan:
python test/prototype/mx_formats/test_inference_workflow.py
Reviewers:
Subscribers:
Tasks:
Tags:
[ghstack-poisoned]use_triton_kernel to use nvfp4_quantize_kernel_choice"1 file changed
Lines changed: 6 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3093 | 3093 | | |
3094 | 3094 | | |
3095 | 3095 | | |
| 3096 | + | |
| 3097 | + | |
| 3098 | + | |
3096 | 3099 | | |
3097 | 3100 | | |
3098 | 3101 | | |
3099 | 3102 | | |
3100 | 3103 | | |
3101 | 3104 | | |
3102 | 3105 | | |
| 3106 | + | |
| 3107 | + | |
| 3108 | + | |
3103 | 3109 | | |
3104 | 3110 | | |
3105 | 3111 | | |
| |||
0 commit comments