neural_compressor.adaptor.tf_utils.graph_rewriter.int8.fuse_matmul_requantize
Fuse QuantizedMatMul with Requantize/Dequantize Graph Rewriter.
Module Contents
Classes
Fuse QuantizedMatMul + Requantize + Dequantize into QuantizedMatMulWithBiasAndDequantize. |
|
Fuse Quantized MatMul Op with the successor Requantize Op. |
|
Fuse _QuantizedMatMul + Requantize + Dequantize into _QuantizedMatMul. |
|
Fuse newAPI Quantized MatMul Op with the successor Requantize Op. |
- class neural_compressor.adaptor.tf_utils.graph_rewriter.int8.fuse_matmul_requantize.FuseMatMulRequantizeDequantizeTransformer(model, device='cpu')[source]
Fuse QuantizedMatMul + Requantize + Dequantize into QuantizedMatMulWithBiasAndDequantize.
- class neural_compressor.adaptor.tf_utils.graph_rewriter.int8.fuse_matmul_requantize.FuseMatMulRequantizeTransformer(model, device='cpu')[source]
Fuse Quantized MatMul Op with the successor Requantize Op.