neural_compressor.torch.algorithms.weight_only.modules
Torch.nn.Module Class Definition.
Module Contents
Classes
Fake version of affine quantization. |
|
Wrapper quantization linear. |
|
Linear wrapper to apply scale to input. |
- class neural_compressor.torch.algorithms.weight_only.modules.FakeAffineTensorQuantFunction[source]
Fake version of affine quantization.