neural_compressor.compression.pruner.patterns.ninm

N:M patterns.

Module Contents

Classes

PytorchPatternNInM

Pruning Pattern.

class neural_compressor.compression.pruner.patterns.ninm.PytorchPatternNInM(config, modules)[source]

Pruning Pattern.

A Pattern class derived from Pattern. In this pattern, N out of every M continuous weights will be pruned. For more info of this pattern, please refer to : https://github.com/intel/neural-compressor/blob/master/docs/sparsity.md

Parameters:

config – A config dict object that contains the pattern information.

N[source]

The number of elements to be pruned in a weight sequence.

M[source]

The size of the weight sequence.