Rename Sparse2x4CUTLASSFloat8Tensor to Float8Sparse2x4_2DData2DMetadataTensor (#4343)#4343
Rename Sparse2x4CUTLASSFloat8Tensor to Float8Sparse2x4_2DData2DMetadataTensor (#4343)#4343bbeckca wants to merge 2 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4343
Note: Links to docs will display an error until the docs builds have been completed. ❗ 1 Active SEVsThere are 1 currently active SEVs. If your PR is affected, please view them below: ❌ 2 New FailuresAs of commit 92e86c5 with merge base 6529fca ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
There was a problem hiding this comment.
can we just remove this file?
There was a problem hiding this comment.
or print a warning here and direct to then new path
There was a problem hiding this comment.
Sounds good. Will redirect using an alias and delete this file.
There was a problem hiding this comment.
is this an error? why do we have 1d_data_1d_metadata tensor here
|
|
||
| This packing format will dispatch to torch._cslt_sparse_mm for matmul, with per-tensor scaling passed as alpha. | ||
| """ | ||
| SPARSE_1D_DATA_1D_METADATA = "sparse_1d_data_1d_metadata" |
There was a problem hiding this comment.
I thought this is added in a separate PR?
There was a problem hiding this comment.
oh it's probably because of the PR is exported from stacked diffs
There was a problem hiding this comment.
I think you're right. Will publish from OSS next time to avoid this.
39bd83f to
b1bcda2
Compare
b1bcda2 to
bcd204e
Compare
|
@bbeckca has exported this pull request. If you are a Meta employee, you can view the originating Diff in D102374347. |
|
@pytorchbot label "module: not user facing" |
…taTensor (pytorch#4343) Summary: Rename the CUTLASS float8 sparse tensor class to describe the memory layout: - Class: Sparse2x4CUTLASSFloat8Tensor → Float8Sparse2x4_2DData2DMetadataTensor - Enum: SPARSE_CUTLASS → SPARSE_2D_DATA_2D_METADATA (old value kept for backward compatibility) The old identifiers to Sparse2x4CUTLASSFloat8Tensor will remain importable using backward compatible aliases. Reviewed By: RandySheriff Differential Revision: D102374347
bcd204e to
e452c2c
Compare
…taTensor (pytorch#4343) Summary: Rename the CUTLASS float8 sparse tensor class to describe the memory layout: - Class: Sparse2x4CUTLASSFloat8Tensor → Float8Sparse2x4_2DData2DMetadataTensor - Enum: SPARSE_CUTLASS → SPARSE_2D_DATA_2D_METADATA (old value kept for backward compatibility) The old identifiers to Sparse2x4CUTLASSFloat8Tensor will remain importable using backward compatible aliases. Reviewed By: RandySheriff Differential Revision: D102374347
e452c2c to
f0135d7
Compare
Summary: X-link: pytorch/pytorch#180312 What: Adding a new tensor subclass for FP8 2:4 sparsity via hipSPARSELt (ROCm only). Packs compressed values + metadata into a single tensor with `_cslt_compress` and dispatches through `_cslt_sparse_mm` with `A_scale * B_scale` as `alpha`. Why: This hipSPARSELt path differs enough in packing and kernel routing from CUTLASS to warrant a dedicated path. Reference: https://rocm.blogs.amd.com/artificial-intelligence/introduce_hipsparselt/README.html Reviewed By: RandySheriff Differential Revision: D100640267
…taTensor (pytorch#4343) Summary: Rename the CUTLASS float8 sparse tensor class to describe the memory layout: - Class: Sparse2x4CUTLASSFloat8Tensor → Float8Sparse2x4_2DData2DMetadataTensor - Enum: SPARSE_CUTLASS → SPARSE_2D_DATA_2D_METADATA (old value kept for backward compatibility) The old identifiers to Sparse2x4CUTLASSFloat8Tensor will remain importable using backward compatible aliases. Reviewed By: RandySheriff Differential Revision: D102374347
f0135d7 to
92e86c5
Compare
Summary:
Rename the CUTLASS float8 sparse tensor class to describe the memory layout:
The old identifiers to Sparse2x4CUTLASSFloat8Tensor will remain importable using backward compatible aliases.
Reviewed By: RandySheriff
Differential Revision: D102374347