-
Notifications
You must be signed in to change notification settings - Fork 501
Add mixed granularity tests for smooth quant #3847
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from 7 commits
5183287
2fcda9b
0196250
546e05f
55c56d0
6a4c73e
501e4c5
2d237ad
85c52e1
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||
|---|---|---|---|---|
|
|
@@ -88,8 +88,17 @@ def setUpClass(cls): | |||
| "base_config", | ||||
| [ | ||||
| Int8DynamicActivationInt8WeightConfig(version=2), | ||||
| Int8StaticActivationInt8WeightConfig(granularity=PerRow()), | ||||
| Int8DynamicActivationInt8WeightConfig(version=2, granularity=PerTensor()), | ||||
| Int8DynamicActivationInt8WeightConfig( | ||||
| version=2, granularity=[PerRow(), PerTensor()] | ||||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. actually why is this a list? I saw type is changed to ao/torchao/quantization/quant_api.py Line 856 in 9058b58
I feel we should not have both tuple and list, can you have a followup PR to change this to
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Thanks for comments, updated the UT.
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. oh I see, in that case we should remove tuple and just use List, I've seen the json error before as well |
||||
| ), | ||||
| Int8DynamicActivationInt8WeightConfig( | ||||
| version=2, granularity=[PerTensor(), PerRow()] | ||||
| ), | ||||
| Int8StaticActivationInt8WeightConfig(), | ||||
| Int8StaticActivationInt8WeightConfig(granularity=PerTensor()), | ||||
| Int8StaticActivationInt8WeightConfig(granularity=[PerRow(), PerTensor()]), | ||||
| Int8StaticActivationInt8WeightConfig(granularity=[PerTensor(), PerRow()]), | ||||
| # Note: float8_static_activation_float8_weight is broken after recent PyTorch update. | ||||
| # TODO(#1639): Fix for supporting more API in torchao/quantization/quant_api.py | ||||
| ], | ||||
|
|
||||
Uh oh!
There was an error while loading. Please reload this page.