neural_compressor.jax.algorithms.static

Static quantization algorithm entry point for JAX models.

Functions

static_quantize(→ keras.Model)

Quantize model using Static quantization algorithm.

Module Contents

neural_compressor.jax.algorithms.static.static_quantize(model: keras.Model, configs_mapping: OrderedDict[str | str, OrderedDict[str, neural_compressor.common.base_config.BaseConfig]] | None = None, quant_config: neural_compressor.common.base_config.BaseConfig | None = None, calib_function: Callable | None = None) keras.Model[source]

Quantize model using Static quantization algorithm.

Parameters:
  • model (keras.Model) – JAX model to be quantized.

  • configs_mapping (Optional[OrderedDict[Union[str, str], OrderedDict[str, BaseConfig]]]) – Mapping of configurations for the algorithm.

  • quant_config (Optional[BaseConfig]) – Quantization configuration for wrapper selection.

  • calib_function (Optional[Callable]) – Calibration function used to collect activation statistics.

Returns:

The quantized model wrapped for inference.

Return type:

keras.Model