neural_compressor.jax.quantization.layers_dynamic
Dynamic quantized layer implementations for JAX-backed Keras models.
Classes
Layer that applies dynamic quantize-dequantize to activations. |
|
Mixin that adds dynamic quantization to dense-like layers. |
|
Dynamically quantized Dense layer. |
|
Dynamically quantized EinsumDense layer. |
|
Dynamically quantized MultiHeadAttention layer. |
|
Dynamically quantized CachedGemma3Attention layer. |
|
Dynamically quantized Gemma3VisionAttention layer. |
|
Dynamically quantized ReversibleEmbedding layer. |
Functions
Register quantized layer class for an original layer class. |
Module Contents
- neural_compressor.jax.quantization.layers_dynamic.register_dynamic_quantized_layer(clso)[source]
Register quantized layer class for an original layer class.
- Parameters:
clso (type) – Original layer class to map to a quantized implementation.
- Returns:
Decorator that registers the quantized class.
- Return type:
Callable
- class neural_compressor.jax.quantization.layers_dynamic.DynamicQDQLayer(name, activation_dtype, asymmetric=False)[source]
Layer that applies dynamic quantize-dequantize to activations.
- class neural_compressor.jax.quantization.layers_dynamic.QDynamicDenseMixin[source]
Mixin that adds dynamic quantization to dense-like layers.
- class neural_compressor.jax.quantization.layers_dynamic.QDynamicDense[source]
Dynamically quantized Dense layer.
- class neural_compressor.jax.quantization.layers_dynamic.QDynamicEinsumDense[source]
Dynamically quantized EinsumDense layer.
- class neural_compressor.jax.quantization.layers_dynamic.QDynamicMultiHeadAttention[source]
Dynamically quantized MultiHeadAttention layer.
- class neural_compressor.jax.quantization.layers_dynamic.QDynamicCachedGemma3Attention[source]
Dynamically quantized CachedGemma3Attention layer.