neural_compressor.compression.pruner.patterns.base
Pruning patterns.
Classes
Pruning Pattern. |
|
Pruning Pattern. |
|
Pruning Pattern. |
Functions
|
Class decorator used to register a Pattern subclass to the registry. |
Module Contents
- neural_compressor.compression.pruner.patterns.base.register_pattern(name)[source]
Class decorator used to register a Pattern subclass to the registry.
Decorator function used before a Pattern subclasses. Make sure that this Pattern class can be registered in PATTERNS.
- Parameters:
name – A string defining the pattern type name to be used in a pruning process.
- Returns:
The class of register.
- Return type:
cls
- class neural_compressor.compression.pruner.patterns.base.BasePattern(config, modules)[source]
Pruning Pattern.
It defines the basic pruning unit and how this unit will be pruned during pruning, e.g. 4x1, 2:4.
- Parameters:
config – A config dict object that contains the pattern information.
modules – neural network modules to be pruned with the pattern.
- is_global[source]
A bool determining whether the pruning takes global pruning option. Global pruning means that pruning scores by a pruning criterion are evaluated in all layers. Local pruning, by contrast, means that pruning scores by the pruning criterion are evaluated
in every layer individually.
- max_sparsity_ratio_per_op[source]
A float representing the maximum sparsity that one layer could reach.
- class neural_compressor.compression.pruner.patterns.base.PytorchBasePattern(config, modules)[source]
Pruning Pattern.
It defines the basic pruning unit and how this unit will be pruned during pruning, e.g. 4x1, 2:4.
- Parameters:
config – A config dict object that contains the pattern information.
modules – neural network modules to be pruned with the pattern.
- is_global[source]
A bool determining whether the pruning takes global pruning option. Global pruning means that pruning scores by a pruning criterion are evaluated in all layers. Local pruning, by contrast, means that pruning scores by the pruning criterion are evaluated
in every layer individually.
- max_sparsity_ratio_per_op[source]
A float representing the maximum sparsity that one layer could reach.
- class neural_compressor.compression.pruner.patterns.base.KerasBasePattern(config, modules)[source]
Pruning Pattern.
It defines the basic pruning unit and how this unit will be pruned during pruning, e.g. 4x1, 2:4.
- Parameters:
config – A config dict object that contains the pattern information.
modules – neural network modules to be pruned with the pattern.
- is_global[source]
A bool determining whether the pruning takes global pruning option. Global pruning means that pruning scores by a pruning criterion are evaluated in all layers. Local pruning, by contrast, means that pruning scores by the pruning criterion are evaluated
in every layer individually.
- max_sparsity_ratio_per_op[source]
A float representing the maximum sparsity that one layer could reach.