neural_compressor.tensorflow.quantization.utils.graph_rewriter.qdq.insert_qdq_pattern
Insert QDQ pattern Graph Rewriter.
Module Contents
Classes
Insert Q/DQ pairs before quantizable ops. |
- class neural_compressor.tensorflow.quantization.utils.graph_rewriter.qdq.insert_qdq_pattern.GenerateGraphWithQDQPattern(model, calibration_data, op_wise_config, fake_quant, fp32_ops, bf16_ops, quantized_nodes, device, performance_only, itex_mode, llm_weight_minmax)[source]
Insert Q/DQ pairs before quantizable ops.