neural_compressor.utils.pytorch
Pytorch utilities.
Functions
|
Check whether the input model is a int8 model. |
|
Load model in weight_only mode. |
|
Execute the quantize process on the specified model. |
|
Recover ipex model from JSON file. |
Module Contents
- neural_compressor.utils.pytorch.is_int8_model(model)[source]
Check whether the input model is a int8 model.
- Parameters:
model (torch.nn.Module) – input model
- Returns:
Return True if the input model is a int8 model.
- Return type:
result(bool)
- neural_compressor.utils.pytorch.load_weight_only(checkpoint_dir, model, layer_wise=False)[source]
Load model in weight_only mode.
- Parameters:
checkpoint_dir (dir/file/dict) – The folder of checkpoint. ‘qconfig.json’ and ‘best_model.pt’ are needed in This directory. ‘checkpoint’ dir is under workspace folder and workspace folder is define in configure yaml file.
model (object) – fp32 model need to do quantization.
- Returns:
quantized model
- Return type:
(object)
- neural_compressor.utils.pytorch.load(checkpoint_dir=None, model=None, layer_wise=False, history_cfg=None, **kwargs)[source]
Execute the quantize process on the specified model.
- Parameters:
checkpoint_dir (dir/file/dict) – The folder of checkpoint. ‘best_configure.yaml’ and ‘best_model_weights.pt’ are needed in This directory. ‘checkpoint’ dir is under workspace folder and workspace folder is define in configure yaml file.
model (object) – fp32 model need to do quantization.
history_cfg (object) – configurations from history.snapshot file.
**kwargs (dict) – contains customer config dict and etc.
- Returns:
quantized model
- Return type:
(object)
- neural_compressor.utils.pytorch.recover_model_from_json(model, json_file_path, example_inputs)[source]
Recover ipex model from JSON file.
- Parameters:
model (object) – fp32 model need to do quantization.
json_file_path (json) – configuration JSON file for ipex.
example_inputs (tuple or torch.Tensor or dict) – example inputs that will be passed to the ipex function.
- Returns:
quantized model
- Return type:
(object)