neural_compressor.tensorflow.quantization.quantize
Module Contents
Functions
|
The main entry to quantize model. |
|
Quantize model using single config. |
- neural_compressor.tensorflow.quantization.quantize.quantize_model(model: str | tensorflow.keras.Model | neural_compressor.tensorflow.utils.BaseModel, quant_config: neural_compressor.common.base_config.BaseConfig | list, calib_dataloader: Callable = None, calib_iteration: int = 100)[source]
The main entry to quantize model.
- Parameters:
model – a fp32 model to be quantized.
quant_config – single or lists of quantization configuration.
calib_dataloader – a data loader for calibration.
calib_iteration – the iteration of calibration.
- Returns:
the quantized model.
- Return type:
q_model
- neural_compressor.tensorflow.quantization.quantize.quantize_model_with_single_config(q_model: neural_compressor.tensorflow.utils.BaseModel, quant_config: neural_compressor.common.base_config.BaseConfig, calib_dataloader: Callable = None, calib_iteration: int = 100)[source]
Quantize model using single config.
- Parameters:
model – a model wrapped by INC TF model class.
quant_config – a quantization configuration.
calib_dataloader – a data loader for calibration.
calib_iteration – the iteration of calibration.
- Returns:
the quantized model.
- Return type:
q_model