Blame: python/paddle/static/quantization/quantization_pass.py - PaddlePaddle/Paddle

PaddlePaddle / Paddle UNCLAIMED

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

0 0 11 C++

Normal View History Raw

[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`# Copyright (c) 2022 PaddlePaddle Authors. All Rights Reserved.`
init quantization. 2019-01-19 02:58:25 +08:00			`#`
			`# Licensed under the Apache License, Version 2.0 (the "License");`
			`# you may not use this file except in compliance with the License.`
			`# You may obtain a copy of the License at`
			`#`
			`# http://www.apache.org/licenses/LICENSE-2.0`
			`#`
			`# Unless required by applicable law or agreed to in writing, software`
			`# distributed under the License is distributed on an "AS IS" BASIS,`
			`# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.`
			`# See the License for the specific language governing permissions and`
			`# limitations under the License.`

			`import collections`
Fix the problem that the quantization model cannot find the weight (#49664) 2023-01-10 16:03:55 +08:00			`import logging`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`import numpy as np`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`try:`
			`from tqdm import tqdm`
			`except:`
			`from .utils import tqdm`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00
delete mean api (#48764) 2022-12-08 17:05:34 +08:00			`import paddle`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00
【Fluid Clean】 Rename paddle.fluid to paddle.base (#56576) * rename fluid to base * fix setup and rename paddle.fluid * fix protobuf generated paddle.fluid * fix some errors * fix cmake depends * fix some error * try to fix cache error * fix codestyle && some doc error * remove build change * fix some error --------- Co-authored-by: XiangGao <jeff41404@gmail.com> 2023-09-07 17:26:19 +08:00			`from ...base.framework import IrGraph, IrNode`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`from ...framework import _get_paddle_place, core`
			`from ...static import Program, data, program_guard, scope_guard`
			`from ...utils import unique_name`
Fix the problem that the quantization model cannot find the weight (#49664) 2023-01-10 16:03:55 +08:00			`from ..log_helper import get_logger`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`from . import utils`
Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`from .quant_config import (`
			`SUPPORT_ACT_QUANTIZATION_OP_DICT,`
			`SUPPORT_QUANTIZATION_OP_DICT,`
			`SUPPORT_WEIGHT_QUANTIZATION_OP_DICT,`
			`)`
add UT for quantization. 2019-01-20 15:24:45 +08:00
Fix the problem that the quantization model cannot find the weight (#49664) 2023-01-10 16:03:55 +08:00			`_logger = get_logger(`
			`__name__, logging.INFO, fmt='%(asctime)s-%(levelname)s: %(message)s'`
			`)`

Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`_fake_quant_op_list = [`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`'fake_quantize_abs_max',`
			`'fake_quantize_range_abs_max',`
			`'fake_quantize_moving_average_abs_max',`
			`'fake_channel_wise_quantize_abs_max',`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`]`

			`_fake_dequant_op_list = [`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`'fake_dequantize_max_abs',`
			`'fake_channel_wise_dequantize_max_abs',`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`]`

Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`_fake_quant_dequant_op_list = [`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`'fake_quantize_dequantize_moving_average_abs_max',`
			`"fake_channel_wise_quantize_dequantize_abs_max",`
fix dygraph new format problem export in QAT (#47023) 2022-10-17 12:14:57 +08:00			`"fake_quantize_dequantize_abs_max",`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`]`

[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`_conv_ops = ['conv2d', 'depthwise_conv2d', 'conv2d_transpose']`

add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`_SCALE_DEFAULT_VALUE = 0.001`
[Quantization] Save output threshold by argname_index (#25272) * Save output threshold by argname_index, test=develop 2020-07-06 10:31:50 +08:00

clean codes and fix some bugs. test=develop 2019-03-27 16:46:39 +08:00			`def _init_var_node(var_node, value, scope, place):`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(value, np.ndarray), (`
			`'The type of value should be numpy array.'`
			`)`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`assert scope is not None, 'The scope cannot be set None.'`
			`assert place is not None, 'The place cannot be set None.'`
clean codes and fix some bugs. test=develop 2019-03-27 16:46:39 +08:00			`tensor = scope.var(var_node.name()).get_tensor()`
			`tensor.set(value, place)`


add mul and matmul quantization, test=develop (#22054) * add mul and matmul quantization, test=develop * add test for matmul, test=develop 2020-01-15 16:36:15 +08:00			`def _is_input_all_not_persistable(graph, op_node):`
			`'''`
			`Analyse the real inputs of the op node are all not persistable.`
			`'''`
			`is_input_all_not_persistable = True`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`for var_name in utils._get_op_input_var_names(op_node):`
Collect output scale for quantized op and fused op (#23369) * Collect output scale for quantized op and fused op * Post_training_quantizaion sets batch_generator to support lod tensor 2020-04-07 16:47:58 +08:00			`in_node = graph._find_node_by_name(op_node.inputs, var_name)`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`is_input_all_not_persistable = is_input_all_not_persistable and (`
			`not in_node.persistable()`
			`)`
add mul and matmul quantization, test=develop (#22054) * add mul and matmul quantization, test=develop * add test for matmul, test=develop 2020-01-15 16:36:15 +08:00			`return is_input_all_not_persistable`


[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class QuantizationTransformPass:`
Collect output scale for quantized op and fused op (#23369) * Collect output scale for quantized op and fused op * Post_training_quantizaion sets batch_generator to support lod tensor 2020-04-07 16:47:58 +08:00			`"""`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`Quantize the ops that have weights. Add quant and dequant ops for`
			`the quantized ops's inputs.`
Collect output scale for quantized op and fused op (#23369) * Collect output scale for quantized op and fused op * Post_training_quantizaion sets batch_generator to support lod tensor 2020-04-07 16:47:58 +08:00			`"""`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def __init__(`
			`self,`
			`scope=None,`
			`place=None,`
			`weight_bits=8,`
			`activation_bits=8,`
			`activation_quantize_type='abs_max',`
			`weight_quantize_type='abs_max',`
			`window_size=10000,`
			`moving_rate=0.9,`
			`skip_pattern=['skip_quant'],`
			`quantizable_op_type=['conv2d', 'depthwise_conv2d', 'mul'],`
			`weight_quantize_func=None,`
			`act_quantize_func=None,`
			`weight_preprocess_func=None,`
			`act_preprocess_func=None,`
			`optimizer_func=None,`
			`executor=None,`
			`is_test=None,`
			`):`
Upgrade string literals to raw string (#28989) * upgrade comment string to raw string * fix string in * fix string with ' ' * revert update on comments * upgrade only necessary * fix sample code checker * fix comments with ''' 2020-11-24 14:53:51 +08:00			`r"""`
Collect output scale for quantized op and fused op (#23369) * Collect output scale for quantized op and fused op * Post_training_quantizaion sets batch_generator to support lod tensor 2020-04-07 16:47:58 +08:00			`Constructor.`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00
init quantization. 2019-01-19 02:58:25 +08:00			`Args:`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`scope(static.Scope): When activation use 'range_abs_max' as the quantize`
Add post_training_quantization (#20800) * add post training quantization, test=develop * specify the quantizable op type, test=develop 2019-11-05 15:18:31 +08:00			`type, this pass will create some new parameters. The scope is used to`
			`initialize these new parameters.`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`place(static.CPUPlace\|static.CUDAPlace\|str): place is used to initialize new`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			parameters described above. If it's string, It can be ``cpu``, and ``gpu:x``,
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			where ``x`` is the index of the GPUs.
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`weight_bits(int): quantization bit number for weights,`
init quantization. 2019-01-19 02:58:25 +08:00			`the bias is not quantized.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`activation_bits(int): quantization bit number for activation.`
			`activation_quantize_type(str): quantization type for activation,`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`now support 'abs_max', 'range_abs_max' and 'moving_average_abs_max'.`
			`If use 'abs_max' mode, the quantization scale will be calculated`
			`dynamically each step in both training and testing period. If use`
			`'range_abs_max', a static quantization scale will be calculated`
			`during training and used in inference.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`weight_quantize_type(str): quantization type for weights,`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`support 'abs_max' and 'channel_wise_abs_max'. The 'range_abs_max'`
			`usually is not used for weight, since weights are fixed once the`
			`model is well trained.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`window_size(int): the window size for 'range_abs_max' quantization.`
			`moving_rate(float): the param for 'moving_average_abs_max' quantization.`
paddleslim quantization skip pattern support list of string (#21141) 2019-11-26 19:05:25 +08:00			`skip_pattern(str or str list): The user-defined quantization skip pattern, which`
add fake_quant_dequant_op for average pool2d, test=develop (#19880) * add fake_quant_dequant_op for average pool2d * add test 2019-09-23 21:22:31 +08:00			`will be presented in the name scope of an op. When the skip pattern is`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`detected in an op's name scope, the corresponding op will not be quantized.`
			`quantizable_op_type(list[str]): List the type of ops that will be quantized.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`Default is ["conv2d", "depthwise_conv2d", "mul"]. The quantizable_op_type in`
			`QuantizationFreezePass and ConvertToInt8Pass must be the same as this.`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`weight_quantize_func(function): Function that defines how to quantize weight.`
			`Using this can quickly test if user's quantization method works or not.`
			`In this function, user should both define quantization function and`
			`dequantization function, that is, the function's input is non-quantized`
			`weight and function returns dequantized weight. If None, will use`
			`quantization op defined by 'weight_quantize_type'. Default is None.`
			`act_quantize_func(function): Function that defines how to quantize activation.`
			`Using this can quickly test if user's quantization method works or not.`
			`In this function, user should both define quantization and dequantization`
			`process, that is, the function's input is non-quantized activation and`
			`function returns dequantized activation. If None, will use quantization`
			`op defined by 'activation_quantize_type'. Default is None.`
			`weight_preprocess_func(function): Function that defines how to preprocess`
			`weight before quantization. Using this can quickly test if user's preprocess`
			`method works or not. The function's input is non-quantized weight and`
			`function returns processed weight to be quantized. If None, the weight will`
			`be quantized directly. Default is None.`
			`act_preprocess_func(function): Function that defines how to preprocess`
			`activation before quantization. Using this can quickly test if user's`
			`preprocess method works or not. The function's input is non-quantized`
			`activation and function returns processed activation to be quantized.`
			`If None, the activation will be quantized directly. Default is None.`
Fix excuted executed (#61153) * Fix * Fix 2024-01-26 10:25:04 +08:00			`optimizer_func(function): Function return a optimizer. When 'is_test' is`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`False and user want to use self-defined quantization function and`
			`preprocess function, this function must be set. Default is None.`
【Fluid Clean】 Rename paddle.fluid to paddle.base (#56576) * rename fluid to base * fix setup and rename paddle.fluid * fix protobuf generated paddle.fluid * fix some errors * fix cmake depends * fix some error * try to fix cache error * fix codestyle && some doc error * remove build change * fix some error --------- Co-authored-by: XiangGao <jeff41404@gmail.com> 2023-09-07 17:26:19 +08:00			`executor(base.Executor): If user want to use self-defined quantization`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`function and preprocess function, executor must be set for initialization.`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`Default is None.`

add comments for public API. test=develop 2019-01-30 23:30:19 +08:00
init quantization. 2019-01-19 02:58:25 +08:00			`Examples:`
[CodeStyle][DocFormat][128] Use `pycon` marker in static nn/quantization docs examples (#77992) 2026-02-20 13:41:25 +08:00			`.. code-block:: pycon`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00
			`>>> # The original graph will be rewrite.`
			`>>> import paddle.static as static`
			`>>> from paddle.static.quantization import QuantizationTransformPass`
【Fluid Clean】 Rename paddle.fluid to paddle.base (#56576) * rename fluid to base * fix setup and rename paddle.fluid * fix protobuf generated paddle.fluid * fix some errors * fix cmake depends * fix some error * try to fix cache error * fix codestyle && some doc error * remove build change * fix some error --------- Co-authored-by: XiangGao <jeff41404@gmail.com> 2023-09-07 17:26:19 +08:00			`>>> from paddle.base.framework import IrGraph`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00			`>>> from paddle.framework import core`

			`>>> graph = IrGraph(core.Graph(static.Program().desc), for_test=False)`
			`>>> place = paddle.CPUPlace()`
			`>>> transform_pass = QuantizationTransformPass(static.global_scope(), place)`
			`>>> transform_pass.apply(graph)`
init quantization. 2019-01-19 02:58:25 +08:00			`"""`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`self._scope = scope`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			`self._place = _get_paddle_place(place)`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`self._weight_bits = weight_bits`
			`self._activation_bits = activation_bits`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`self._skip_pattern = skip_pattern`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`self._weight_quantize_func = weight_quantize_func`
			`self._act_quantize_func = act_quantize_func`
			`self._weight_preprocess_func = weight_preprocess_func`
			`self._act_preprocess_func = act_preprocess_func`
			`self._optimizer = optimizer_func`
			`self._exe = executor`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`quant_type = [`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`'abs_max',`
			`'channel_wise_abs_max',`
			`'range_abs_max',`
			`'moving_average_abs_max',`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`]`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert activation_quantize_type != 'channel_wise_abs_max', (`
			`"The activation quantization type does not support 'channel_wise_abs_max'."`
			`)`
init quantization. 2019-01-19 02:58:25 +08:00			`if activation_quantize_type not in quant_type:`
			`raise ValueError(`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f"Unknown activation_quantize_type : '{activation_quantize_type}'. It can only be "`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`"'abs_max' or 'range_abs_max' or 'moving_average_abs_max'."`
			`)`
init quantization. 2019-01-19 02:58:25 +08:00			`if weight_quantize_type not in quant_type:`
			`raise ValueError(`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f"Unknown weight_quantize_type: '{weight_quantize_type}'. It can only be "`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`"'abs_max' or 'channel_wise_abs_max' or 'range_abs_max' "`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`"or 'moving_average_abs_max'."`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
init quantization. 2019-01-19 02:58:25 +08:00
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`self._activation_quantize_type = activation_quantize_type`
			`self._weight_quantize_type = weight_quantize_type`
			`self._window_size = window_size`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`self._moving_rate = moving_rate`
init quantization. 2019-01-19 02:58:25 +08:00
Add post_training_quantization (#20800) * add post training quantization, test=develop * specify the quantizable op type, test=develop 2019-11-05 15:18:31 +08:00			`self._quantizable_ops = quantizable_op_type`
			`for op in self._quantizable_ops:`
Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`assert op in list(SUPPORT_WEIGHT_QUANTIZATION_OP_DICT.keys()), (`
Add post_training_quantization (#20800) * add post training quantization, test=develop * specify the quantizable op type, test=develop 2019-11-05 15:18:31 +08:00			`op + " is not supported for quantization."`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`self._quantizable_grad_ops = [`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f'{op}_grad' for op in self._quantizable_ops`
init quantization. 2019-01-19 02:58:25 +08:00			`]`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`self._is_test = is_test`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`self._global_step = None`
init quantization. 2019-01-19 02:58:25 +08:00
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`self.create_var_map = {}`
			`self.create_op_map = {}`

Add quantization transform pass and UT. 2019-01-21 21:24:05 +08:00			`def apply(self, graph):`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00			`"""`
			`Quantize the graph for training process. According to weight and`
			`activation quantization type, the graph will be added some fake`
			`quantize operators and fake dequantize operators.`

			`Args:`
			`graph(IrGraph): the applied graph.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`Returns:`
			`None`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00			`"""`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`if self._is_test is None:`
			`self._is_test = graph.is_test()`
init quantization. 2019-01-19 02:58:25 +08:00			`# marked the variable which has been dequantized.`
			`dequantized_vars = collections.OrderedDict()`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`persistable_vars = [p.name() for p in graph.all_persistable_nodes()]`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`processed_vars = []`
init quantization. 2019-01-19 02:58:25 +08:00
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`def _quant_preprocess(op_node):`
paddleslim quantization skip pattern support list of string (#21141) 2019-11-26 19:05:25 +08:00			`user_skipped = False`
			`if isinstance(self._skip_pattern, list):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`user_skipped = op_node.op().has_attr("op_namescope") and any(`
			`pattern in op_node.op().attr("op_namescope")`
			`for pattern in self._skip_pattern`
			`)`
paddleslim quantization skip pattern support list of string (#21141) 2019-11-26 19:05:25 +08:00			`elif isinstance(self._skip_pattern, str):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`user_skipped = (`
			`op_node.op().has_attr("op_namescope")`
			`and op_node.op()`
			`.attr("op_namescope")`
			`.find(self._skip_pattern)`
			`!= -1`
			`)`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00
Move pool2d to add_quant_dequant_pass, test=develop (#20586) * move pool2d to add_quant_dequant_pass, test=develop 2019-10-16 20:51:15 +08:00			`if user_skipped:`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`op_node.op()._set_attr("skip_quant", True)`
Set attribute "with_quant_attr" into quantized operators (#35583) 2021-09-10 19:03:56 +08:00			`op_node.op()._set_attr("with_quant_attr", True)`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00
init quantization. 2019-01-19 02:58:25 +08:00			`def _transform_forward(graph, op):`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`op.op()._set_attr("quantization_type", "qat_with_weight")`
Set attribute "with_quant_attr" into quantized operators (#35583) 2021-09-10 19:03:56 +08:00			`op.op()._set_attr("with_quant_attr", True)`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`op_role = op.op().attr("op_role")`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`inputs = op.inputs`
			`for var_node in inputs:`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`if var_node.name() not in op.input_arg_names():`
			`continue`
init quantization. 2019-01-19 02:58:25 +08:00			`if var_node.name() in dequantized_vars:`
			`dequant_var_node = dequantized_vars[var_node.name()]`
			`else:`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`name = var_node.name()`
			`if name in processed_vars:`
			`continue`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`is_weight = (`
			`True if var_node.name() in persistable_vars else False`
			`)`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00
			`# if var node is weight and weight_preprocess_func is not None,`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`# will insert weight preprocess func`
Fix minimun minimum (#60920) 2024-01-18 11:31:56 +08:00			`# to preprocess weight before quantization`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`# if var node is activation and act_preprocess_func is not None,`
			`# will insert activation preprocess func`
Fix minimun minimum (#60920) 2024-01-18 11:31:56 +08:00			`# to preprocess activation before quantization`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`if is_weight and self._weight_preprocess_func is not None:`
			`var_node = self._insert_func(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph, self._weight_preprocess_func, var_node, op`
			`)`
			`elif (`
			`not is_weight and self._act_preprocess_func is not None`
			`):`
			`var_node = self._insert_func(`
			`graph, self._act_preprocess_func, var_node, op`
			`)`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00
			`# if var node is weight and weight_quantize_func is not None,`
			`# will insert weight quantize func to quantize and dequantize weight`
			`# if var node is activation and act_quantize_func is not None,`
			`# will insert act quantize func to quantize and dequantize activation`
			`if is_weight and self._weight_quantize_func is not None:`
			`target_out_node = self._insert_func(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph, self._weight_quantize_func, var_node, op`
			`)`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`processed_vars.append(name)`
			`continue`
			`elif not is_weight and self._act_quantize_func is not None:`
			`target_out_node = self._insert_func(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph, self._act_quantize_func, var_node, op`
			`)`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`processed_vars.append(name)`
			`continue`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`quant_bits = (`
			`self._weight_bits`
			`if var_node.name() in persistable_vars`
add mul and matmul quantization, test=develop (#22054) * add mul and matmul quantization, test=develop * add test for matmul, test=develop 2020-01-15 16:36:15 +08:00			`else self._activation_bits`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
			`quant_type = (`
			`self._weight_quantize_type`
			`if is_weight`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`else self._activation_quantize_type`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
			`if (`
			`quant_type == 'channel_wise_abs_max'`
			`): # Weight quantization`
fix the bug of quanting matmul (#52833) 2023-04-18 17:35:01 +08:00			`op_type = op.name()`
			`trans_y = (op_type == 'matmul_v2') and op.op().attr(`
			`'trans_y'`
			`)`
			`op_type = op_type + '_trans_y' if trans_y else op_type`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`quant_axis = (`
			`1`
fix the bug of quanting matmul (#52833) 2023-04-18 17:35:01 +08:00			`if op_type in utils._channelwise_quant_axis1_ops`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`else 0`
			`)`
			`(`
			`quant_var_node,`
			`scale_var_node,`
			`) = self._insert_channel_quant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph,`
			`var_node,`
			`name,`
			`quant_bits,`
			`quant_axis,`
			`op_role,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`dequant_var_node = self._insert_channel_dequant_op(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph,`
			`quant_var_node,`
			`[scale_var_node],`
			`[quant_bits],`
			`quant_axis,`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`op_role,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`else:`
			`quant_var_node, scale_var_node = self._insert_quant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph,`
			`var_node,`
			`name,`
			`quant_bits,`
			`quant_type,`
			`op_role,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`dequant_var_node = self._insert_dequant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph,`
			`quant_var_node,`
			`scale_var_node,`
			`quant_bits,`
			`op_role,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`dequantized_vars[name] = dequant_var_node`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`graph.update_input_link(var_node, dequant_var_node, op)`
add UT for quantization. 2019-01-20 15:24:45 +08:00
			`def _transform_backward(graph, op):`
			`for var_node in op.inputs:`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`if var_node.name() not in op.input_arg_names():`
			`continue`
add UT for quantization. 2019-01-20 15:24:45 +08:00			`if var_node.name() in dequantized_vars:`
			`dequant_var_node = dequantized_vars[var_node.name()]`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`graph.update_input_link(var_node, dequant_var_node, op)`
init quantization. 2019-01-19 02:58:25 +08:00
support quantization of bert (#36593) 2021-10-28 15:08:10 +08:00			`def _has_weight(op):`
			`has_weight = False`
			`for var_node in op.inputs:`
			`if var_node.name() not in op.input_arg_names():`
			`continue`
			`name = var_node.name()`
			`if var_node.name() in persistable_vars:`
			`has_weight = True`
			`return has_weight`

Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`if not self._is_test:`
init quantization. 2019-01-19 02:58:25 +08:00			`self._create_global_step(graph)`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`ops = graph.all_op_nodes()`
Fix excuted executed (#61153) * Fix * Fix 2024-01-26 10:25:04 +08:00			`# Do the preprocess of quantization, such as skipping some ops`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`# for not being quantized.`
			`for op in ops:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`op.name() in self._quantizable_ops`
			`or op.name() in self._quantizable_grad_ops`
			`):`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`_quant_preprocess(op)`
saving inference model when user define activation or weight preprocess function (#25749) * saving inference model for user defined quantization model * saving inference model for user defined quantization model * fixed ci coverage 2020-07-28 19:17:05 +08:00			`# Insert mapping table to solve the problem in saving inference model.`
[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and unnecessary <list/tuple> passed to <list/tupule>() (#51928) * autofix * add select config * autofix C410 * add C410 select 2023-03-23 10:16:17 +08:00			`graph.out_node_mapping_table = {}`
add UT for quantization. 2019-01-20 15:24:45 +08:00			`# The process of _transform_forward and _transform_backward is needed in two for loops.`
			`# The loop for transforming the forward graph:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`with tqdm(`
			`total=len(ops),`
			`bar_format='Adding quant op with weight:\|{bar}\| {n_fmt}/{total_fmt}',`
			`ncols=80,`
			`) as t:`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`for op in ops:`
			`if op.name() in self._quantizable_ops:`
			`if not self._is_skip_quant(graph, op) and _has_weight(op):`
			`_transform_forward(graph, op)`
			`t.update()`
add UT for quantization. 2019-01-20 15:24:45 +08:00			`# The loop for renaming the inputs of backward op.`
			`for op in ops:`
support quantization of bert (#36593) 2021-10-28 15:08:10 +08:00			`if op.name() in self._quantizable_grad_ops and _has_weight(op):`
init quantization. 2019-01-19 02:58:25 +08:00			`_transform_backward(graph, op)`
Use the resolve hazard method. 2019-03-26 14:08:12 +08:00			`graph.resolve_hazard()`
Add quantization transform pass and UT. 2019-01-21 21:24:05 +08:00			`return graph`
init quantization. 2019-01-19 02:58:25 +08:00
add UT for quantization. 2019-01-20 15:24:45 +08:00			`def _create_global_step(self, graph):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`self._weight_quantize_type == 'range_abs_max'`
			`or self._activation_quantize_type == 'range_abs_max'`
			`):`
[CodeStyle][py2] remove `compat` module (to_text) (#47036) * [CodeStyle][py2] remove `compat` module (to_text) * remove some unnecessary decode * remove to_text definition and unittest * Revert "remove to_text definition and unittest" This reverts commit a6b69cb8dca8b9b031ce10ea32d1040e7e0dd267. * remove an assertion * empty commit 2022-10-18 10:27:15 +08:00			`counter_name = '@STEP_COUNTER@'`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`for node in graph.all_var_nodes():`
add UT for quantization. 2019-01-20 15:24:45 +08:00			`if node.name() == counter_name:`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`self._global_step = node`
			`if self._global_step is None:`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`global_step_in = graph.create_persistable_node(`
add UT for quantization. 2019-01-20 15:24:45 +08:00			`name=counter_name,`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
add UT for quantization. 2019-01-20 15:24:45 +08:00			`shape=[1],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=core.VarDesc.VarType.INT64,`
			`)`
			`_init_var_node(`
			`global_step_in,`
			`np.zeros([1], dtype='int64'),`
			`self._scope,`
			`self._place,`
			`)`
add UT for quantization. 2019-01-20 15:24:45 +08:00			`global_step_out = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`global_step_in.var()`
			`)`
add op_role attr when creating op node. 2019-01-28 19:26:02 +08:00			# The attribute of `op_role` is needed by ParallelExecutor.
add UT for quantization. 2019-01-20 15:24:45 +08:00			`increment_op = graph.create_op_node(`
			`op_type='increment',`
add op_role attr when creating op node. 2019-01-28 19:26:02 +08:00			`attrs={`
			`'step': 1.0,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`'op_role': core.op_proto_and_checker_maker.OpRole.Forward,`
add op_role attr when creating op node. 2019-01-28 19:26:02 +08:00			`},`
add UT for quantization. 2019-01-20 15:24:45 +08:00			`inputs={'X': global_step_in},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`outputs={'Out': global_step_out},`
			`)`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`graph.link_to(global_step_in, increment_op)`
			`graph.link_to(increment_op, global_step_out)`
			`self._global_step = global_step_out`
add UT for quantization. 2019-01-20 15:24:45 +08:00
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`def _insert_quant_op(`
			`self, graph, var_node, name, quant_bits, quant_type, op_role`
			`):`
init quantization. 2019-01-19 02:58:25 +08:00			`"""`
			`Insert fake_quantize_op in the graph.`
			`"""`
			`if quant_type == 'abs_max':`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`return self._insert_quant_abs_max_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph, var_node, name, quant_bits, op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
init quantization. 2019-01-19 02:58:25 +08:00			`elif quant_type == 'range_abs_max':`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`return self._insert_quant_range_abs_max_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph, var_node, name, quant_bits, op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`elif quant_type == 'moving_average_abs_max':`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`return self._insert_quant_moving_average_abs_max_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph, var_node, name, quant_bits, op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
init quantization. 2019-01-19 02:58:25 +08:00
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`def _insert_quant_abs_max_op(`
			`self, graph, var_node, name, quant_bits, op_role`
			`):`
init quantization. 2019-01-19 02:58:25 +08:00			`"""`
			`Insert fake_quantize_abs_max op in the graph.`
			`"""`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`assert var_node.is_var(), f'{var_node.name()} is not a var'`
init quantization. 2019-01-19 02:58:25 +08:00
			`quant_var_node = graph.create_var_node(`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`name=self._quantized_var_name(name),`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`scale_name = self._quantized_scale_name(name)`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`try:`
			`scale_value = np.array(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`self._scope.find_var(scale_name).get_tensor()`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`except:`
			`scale_value = np.zeros([1], dtype=data_type)`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`scale_var_node = graph.create_persistable_node(`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`name=scale_name,`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`var_type=var_node.type(),`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`shape=[1],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`_init_var_node(scale_var_node, scale_value, self._scope, self._place)`

init quantization. 2019-01-19 02:58:25 +08:00			`quant_op_node = graph.create_op_node(`
			`op_type='fake_quantize_abs_max',`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`attrs={'bit_length': quant_bits, 'op_role': op_role},`
init quantization. 2019-01-19 02:58:25 +08:00			`inputs={'X': var_node},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`outputs={'Out': quant_var_node, 'OutScale': scale_var_node},`
			`)`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`graph.link_to(var_node, quant_op_node)`
			`graph.link_to(quant_op_node, quant_var_node)`
			`graph.link_to(quant_op_node, scale_var_node)`
init quantization. 2019-01-19 02:58:25 +08:00			`return quant_var_node, scale_var_node`

support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`def _insert_quant_range_abs_max_op(`
			`self, graph, var_node, name, quant_bits, op_role`
			`):`
init quantization. 2019-01-19 02:58:25 +08:00			`"""`
			`Insert fake_quantize_range_abs_max on the graph.`
			`"""`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`assert var_node.is_var(), f'{var_node.name()} is not a var'`
init quantization. 2019-01-19 02:58:25 +08:00
			`quant_var_node = graph.create_var_node(`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`name=self._quantized_var_name(name),`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
init quantization. 2019-01-19 02:58:25 +08:00
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`scale_name = self._quantized_scale_name(name)`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`try:`
			`scale_value = np.array(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`self._scope.find_var(scale_name).get_tensor()`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`except:`
			`scale_value = np.array([_SCALE_DEFAULT_VALUE], dtype=data_type)`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`scale_in_node = graph.create_persistable_node(`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`name=scale_name,`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
init quantization. 2019-01-19 02:58:25 +08:00			`shape=[1],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`_init_var_node(scale_in_node, scale_value, self._scope, self._place)`
init quantization. 2019-01-19 02:58:25 +08:00
			`scale_out_node = graph.create_var_node_from_desc(scale_in_node.var())`
			`inputs = {'X': var_node, 'InScale': scale_in_node}`
			`outputs = {'Out': quant_var_node, 'OutScale': scale_out_node}`

Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`if not self._is_test:`
init quantization. 2019-01-19 02:58:25 +08:00			`# The name of scales_var_node maybe 'scales_0', 'scales_1', etc.`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`scales_node = graph.create_persistable_node(`
init quantization. 2019-01-19 02:58:25 +08:00			`name=unique_name.generate('scales'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`shape=[self._window_size],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`_init_var_node(`
			`scales_node,`
			`np.zeros([self._window_size], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
clean codes and fix some bugs. test=develop 2019-03-27 16:46:39 +08:00
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`inputs['Iter'] = self._global_step`
init quantization. 2019-01-19 02:58:25 +08:00			`outputs['OutScales'] = scales_node`
			`attrs = {`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`'window_size': self._window_size,`
init quantization. 2019-01-19 02:58:25 +08:00			`'bit_length': quant_bits,`
add op_role attr when creating op node. 2019-01-28 19:26:02 +08:00			`'is_test': self._is_test,`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`'op_role': op_role,`
init quantization. 2019-01-19 02:58:25 +08:00			`}`
			`quant_op_node = graph.create_op_node(`
			`op_type='fake_quantize_range_abs_max',`
			`attrs=attrs,`
			`inputs=inputs,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`outputs=outputs,`
			`)`
init quantization. 2019-01-19 02:58:25 +08:00
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`graph.link_to(var_node, quant_op_node)`
			`graph.link_to(scale_in_node, quant_op_node)`
			`graph.link_to(quant_op_node, quant_var_node)`
			`graph.link_to(quant_op_node, scale_out_node)`
init quantization. 2019-01-19 02:58:25 +08:00
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`if not self._is_test:`
			`graph.link_to(self._global_step, quant_op_node)`
			`graph.link_to(quant_op_node, scales_node)`
init quantization. 2019-01-19 02:58:25 +08:00
			`return quant_var_node, scale_out_node`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def _insert_quant_moving_average_abs_max_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`self, graph, var_node, name, quant_bits, op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`):`
			`"""Insert fake_quantize_moving_average_abs_max"""`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`quant_var_node = graph.create_var_node(`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`name=self._quantized_var_name(name),`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`scale_name = self._quantized_scale_name(name)`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`try:`
			`scale_value = np.array(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`self._scope.find_var(scale_name).get_tensor()`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`except:`
			`scale_value = np.array([_SCALE_DEFAULT_VALUE], dtype=data_type)`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`scale_in_node = graph.create_persistable_node(`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`name=scale_name,`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`shape=[1],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`_init_var_node(scale_in_node, scale_value, self._scope, self._place)`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00
			`scale_out_node = graph.create_var_node_from_desc(scale_in_node.var())`
			`ins = {'X': var_node, 'InScale': scale_in_node}`
			`outs = {'Out': quant_var_node, 'OutScale': scale_out_node}`
			`if not self._is_test:`
			`state_in_node = graph.create_persistable_node(`
			`name=unique_name.generate('state'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`var_dtype=var_node.dtype(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`shape=[1],`
			`)`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`_init_var_node(`
			`state_in_node,`
			`np.ones([1], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`accum_in_node = graph.create_persistable_node(`
			`name=unique_name.generate('accum'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`var_dtype=var_node.dtype(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`shape=[1],`
			`)`
			`_init_var_node(`
			`accum_in_node,`
			`np.ones([1], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`state_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`state_in_node.var()`
			`)`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`accum_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`accum_in_node.var()`
			`)`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00
			`ins['InState'] = state_in_node`
			`ins['InAccum'] = accum_in_node`
			`outs['OutState'] = state_out_node`
			`outs['OutAccum'] = accum_out_node`

			`attrs = {`
			`'bit_length': quant_bits,`
			`'moving_rate': self._moving_rate,`
			`'is_test': self._is_test,`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`'op_role': op_role,`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00			`}`

			`quant_op_node = graph.create_op_node(`
			`op_type='fake_quantize_moving_average_abs_max',`
			`attrs=attrs,`
			`inputs=ins,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`outputs=outs,`
			`)`
add moving average absmax op and fix bug (#15155) * Add moving average absmax op in quantilize-aware training. 2019-03-15 15:40:37 +08:00
			`graph.link_to(var_node, quant_op_node)`
			`graph.link_to(scale_in_node, quant_op_node)`
			`graph.link_to(quant_op_node, quant_var_node)`
			`graph.link_to(quant_op_node, scale_out_node)`

			`if not self._is_test:`
			`graph.link_to(state_in_node, quant_op_node)`
			`graph.link_to(accum_in_node, quant_op_node)`
			`graph.link_to(quant_op_node, state_out_node)`
			`graph.link_to(quant_op_node, accum_out_node)`

			`return quant_var_node, scale_out_node`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def _insert_channel_quant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`self, graph, var_node, name, quant_bits, quant_axis, op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`):`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`"""`
			`Insert fake_channel_wise_quantize_abs_max op in the graph.`
			`"""`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`assert var_node.is_var(), f'{var_node.name()} is not a var'`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00
			`quant_var_node = graph.create_var_node(`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`name=self._quantized_var_name(name),`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`scale_name = self._quantized_scale_name(name)`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`try:`
			`scale_value = np.array(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`self._scope.find_var(scale_name).get_tensor()`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`except:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`scale_value = np.zeros(`
			`[var_node.shape()[quant_axis]], dtype=data_type`
			`)`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`scale_var_node = graph.create_persistable_node(`
support user defined quantization func and preprocess (#24720) * add user defined func test=develop * update test=develop * update test=develop * fix name conflicts test=develop * add unittest test=develop * change 2018 to 2020 test=develop * add comment test=develop * add comment for function test=develop * fix details test=develop * fix details test=develop 2020-06-04 20:10:05 +08:00			`name=self._quantized_scale_name(name),`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`var_type=var_node.type(),`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`shape=[var_node.shape()[quant_axis]],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`_init_var_node(scale_var_node, scale_value, self._scope, self._place)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`quant_op_node = graph.create_op_node(`
			`op_type='fake_channel_wise_quantize_abs_max',`
			`attrs={`
			`'bit_length': quant_bits,`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`'quant_axis': quant_axis,`
Add test attribute in channelwise_quant op, test=develop (#27742) * Add test attribute in channelwise_quant op, test=develop 2020-10-12 10:10:25 +08:00			`'is_test': self._is_test,`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`'op_role': op_role,`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`},`
			`inputs={'X': var_node},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`outputs={'Out': quant_var_node, 'OutScale': scale_var_node},`
			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`graph.link_to(var_node, quant_op_node)`
			`graph.link_to(quant_op_node, quant_var_node)`
			`graph.link_to(quant_op_node, scale_var_node)`
			`return quant_var_node, scale_var_node`

support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`def _insert_dequant_op(`
			`self, graph, var_node, scale_var_node, quant_bits, op_role`
			`):`
init quantization. 2019-01-19 02:58:25 +08:00			`"""`
			`Insert fake_dequantize_op in the graph.`
			`"""`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`assert var_node.is_var(), f'{var_node.name()} is not a var'`
init quantization. 2019-01-19 02:58:25 +08:00
			`dequant_var_node = graph.create_var_node(`
			`name=self._dequantized_var_name(var_node.name()),`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
init quantization. 2019-01-19 02:58:25 +08:00			`max_range = (1 << (quant_bits - 1)) - 1`
			`dequant_op_node = graph.create_op_node(`
			`op_type='fake_dequantize_max_abs',`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`attrs={'max_range': float(max_range), 'op_role': op_role},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`inputs={'X': var_node, 'Scale': scale_var_node},`
			`outputs={'Out': dequant_var_node},`
			`)`
Update according to the reviewers' suggestion. test=develop 2019-01-22 14:52:12 +08:00			`graph.link_to(var_node, dequant_op_node)`
			`graph.link_to(scale_var_node, dequant_op_node)`
			`graph.link_to(dequant_op_node, dequant_var_node)`
init quantization. 2019-01-19 02:58:25 +08:00			`return dequant_var_node`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def _insert_channel_dequant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`self, graph, var_node, scale_var_nodes, quant_bits, quant_axis, op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`):`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`"""`
			`Insert fake_channel_wise_dequantize_max_abs in the graph.`
			`"""`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`assert var_node.is_var(), f'{var_node.name()} is not a var'`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00
			`dequant_var_node = graph.create_var_node(`
			`name=self._dequantized_var_name(var_node.name()),`
			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`dequant_op_node = graph.create_op_node(`
			`op_type='fake_channel_wise_dequantize_max_abs',`
			`attrs={`
			`'quant_bits': quant_bits,`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`'quant_axis': quant_axis,`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`'op_role': op_role,`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`inputs={'X': var_node, 'Scales': scale_var_nodes},`
			`outputs={'Out': dequant_var_node},`
			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`graph.link_to(var_node, dequant_op_node)`
			`for scale_n in scale_var_nodes:`
			`graph.link_to(scale_n, dequant_op_node)`
			`graph.link_to(dequant_op_node, dequant_var_node)`
			`return dequant_var_node`

Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`def _create_new_node(self, graph, in_node):`
			`"""`
			`create a node that same with in_node in graph`
			`Args:`
			`graph(IrGraph): create node in graph.`
			`in_node(IrVarNode): create node that same with in_node.`
			`Returns:`
			`created new node`
			`"""`
			`key = ''`
			`for inp in in_node.inputs:`
			`key = key + inp.name()`
			`key = key + in_node.name()`
			`for inp in in_node.outputs:`
			`key = key + inp.name()`

			`if key in self.create_var_map.keys():`
			`new_node = self.create_var_map[key]`
			`elif in_node.is_ctrl_var():`
			`new_node = graph.create_control_dep_var()`
			`self.create_var_map[key] = new_node`
			`else:`
			`new_node = graph.create_var_node_from_desc(in_node.node.var())`
			`self.create_var_map[key] = new_node`
			`return new_node`

			`def _copy_graph(self, graph, source_graph, op_node):`
			`"""`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`copy op_node in source_graph to graph. And will run recursively`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`for next ops that link to op_node's outputs.`
			`Args:`
			`graph(IrGraph): target graph to copy.`
			`source_graph(IrGraph): source graph to copy.`
			`op_node(IrOpNode): op node in source_graph.`
			`Returns:`
			`None`

			`"""`
			`key = ''`
			`for inp in op_node.inputs:`
			`key = key + inp.name()`
			`key = key + op_node.name()`
			`for inp in op_node.outputs:`
			`key = key + inp.name()`
			`has_created = False`
			`if key in self.create_op_map.keys():`
			`new_op_node = self.create_op_map[key]`
			`has_created = True`
			`else:`
			`new_op_node = graph.create_op_node_from_desc(op_node.node.op())`
			`self.create_op_map[key] = new_op_node`
			`if has_created:`
			`return`
			`for in_node in op_node.inputs:`
			`new_node = self._create_new_node(graph, in_node)`
			`graph.link_to(new_node, new_op_node)`
			`for in_node in op_node.outputs:`
			`new_node = self._create_new_node(graph, in_node)`
			`graph.link_to(new_op_node, new_node)`
			`for var_node in op_node.outputs:`
			`for next_op_node in var_node.outputs:`
			`self._copy_graph(graph, source_graph, next_op_node)`
			`return`

			`def _insert_func(self, graph, func, var_node, op):`
			`"""`
			`Insert a tmp program that returned by func between var_node and op.`

			`Args:`
			`graph(IrGraph): target graph to insert tmp program.`
			`func(Function): function to define a tmp program`
			`var_node(IrVarNode): node in target graph.`
			`op(IrOpNode): op in target graph.`
			`Returns:`
			`op's new input that replaces var_node`
			`"""`
			`tmp_program = Program()`
			`startup_program = Program()`
[CodeStyle][SIM117] Combine multiple `with` statements (part3) (#73660) 2025-06-30 01:19:03 +08:00			`with (`
			`program_guard(tmp_program, startup_program),`
			`tmp_program.switch_name_generator_guard(var_node.name() + "_"),`
			`):`
			`in_node = data(`
			`var_node.name() + '_tmp_input',`
			`shape=var_node.shape(),`
			`dtype='float32',`
			`)`
			`out_node = func(in_node)`
			`graph.out_node_mapping_table[out_node.name] = var_node.name()`
			`# loss shape must be 1 when minimize`
			`loss = paddle.mean(out_node)`
			`if not graph._for_test:`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert self._optimizer, (`
			`"optimizer_func must be set when graph is test graph"`
			`)`
[CodeStyle][SIM117] Combine multiple `with` statements (part3) (#73660) 2025-06-30 01:19:03 +08:00			`in_node.stop_gradient = False`
			`optimizer = self._optimizer()`
			`optimizer.minimize(loss)`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`with scope_guard(self._scope):`
			`self._exe.run(startup_program)`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`tmp_graph = IrGraph(`
			`core.Graph(tmp_program.desc), for_test=graph._for_test`
			`)`
			`in_node = tmp_graph._find_node_by_name(`
			`tmp_graph.all_var_nodes(), in_node.name`
			`)`
			`out_node = tmp_graph._find_node_by_name(`
			`tmp_graph.all_var_nodes(), out_node.name`
			`)`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00
			`in_node_params = []`
			`in_op_node = []`
			`# copy tmp graph to graph, after that, we can insert tmp graph's copy to graph.`
			`for node in tmp_graph.all_var_nodes():`
			`if node.inputs == [] and node.persistable():`
			`in_node_params.append(node)`
			`for node in tmp_graph.all_op_nodes():`
			`if node.inputs == []:`
			`in_op_node.append(node)`
			`for node in in_node.outputs:`
			`self._copy_graph(graph, tmp_graph, node)`
			`for node in in_node_params:`
			`for op_node in node.outputs:`
			`self._copy_graph(graph, tmp_graph, op_node)`
			`for node in in_op_node:`
			`self._copy_graph(graph, tmp_graph, node)`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`target_in_node = graph._find_node_by_name(`
			`graph.all_var_nodes(), in_node.name()`
			`)`
			`target_out_node = graph._find_node_by_name(`
			`graph.all_var_nodes(), out_node.name()`
			`)`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`loss_node = graph._find_node_by_name(graph.all_var_nodes(), loss.name)`
			`outputs = target_in_node.outputs`
			`for node in outputs:`
			`graph.update_input_link(target_in_node, var_node, node)`
			`graph.update_input_link(var_node, target_out_node, op)`

			`# update grad`
			`if not graph._for_test:`
			`op_out = op.outputs[0]`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_out_grad = graph._find_node_by_name(`
			`graph.all_var_nodes(), op_out.name() + "@GRAD"`
			`)`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`# find op's gradient op, such as conv2d_grad`
			`op_grad = op_out_grad.outputs[0]`
			`target_out_grad_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph.all_var_nodes(), target_out_node.name() + "@GRAD"`
			`)`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`in_node_grad = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph.all_var_nodes(), target_in_node.name() + "@GRAD"`
			`)`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`in_node_grad_op = in_node_grad.inputs`
			`# update op_grad's input`
			`graph.update_input_link(var_node, target_out_node, op_grad)`

			`op_grad_out = None`
			`# find var_node's corresponding grad node`
			`for node in op_grad.outputs:`
			`if var_node.name() + "@GRAD" in node.name():`
			`op_grad_out = node`
			`# update op_grad's output`
			`if op_grad_out is not None:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph.update_output_link(`
			`op_grad_out, target_out_grad_node, op_grad`
			`)`
Support conv2d_traspose quantize, test=develop (#25084) 2020-06-17 15:26:49 +08:00			`else:`
			`graph.link_to(op_grad, target_out_grad_node)`

			`for node in in_node_grad_op:`
			`graph.update_input_link(target_in_node, var_node, node)`
			`if op_grad_out:`
			`graph.update_output_link(in_node_grad, op_grad_out, node)`
			`# remove useless nodes`
			`mean_grad = target_out_grad_node.inputs[0]`
			`mean_out_grad = mean_grad.inputs[0]`
			`fill_constant_node = mean_out_grad.inputs[0]`
			`graph.safe_remove_nodes(mean_grad)`
			`graph.safe_remove_nodes(mean_out_grad)`
			`graph.safe_remove_nodes(fill_constant_node)`
			`graph.safe_remove_nodes(in_node_grad)`

			`graph.safe_remove_nodes(loss_node.inputs[0])`
			`graph.safe_remove_nodes(loss_node)`
			`graph.safe_remove_nodes(target_in_node)`
			`return target_out_node`

init quantization. 2019-01-19 02:58:25 +08:00			`def _quantized_var_name(self, var_name):`
			`"""`
			Return quantized variable name for the input `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}.quantized"`
init quantization. 2019-01-19 02:58:25 +08:00
			`def _dequantized_var_name(self, var_name):`
			`"""`
			Return dequantized variable name for the input `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}.dequantized"`
init quantization. 2019-01-19 02:58:25 +08:00
			`def _quantized_scale_name(self, var_name):`
			`"""`
Update some comments in the quantization transform pass. test=develop 2019-01-22 15:37:04 +08:00			Return the scale name of quantized variable for the input `var_name`.
init quantization. 2019-01-19 02:58:25 +08:00			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}@scale"`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`def _is_skip_quant(self, graph, op_node):`
add mul and matmul quantization, test=develop (#22054) * add mul and matmul quantization, test=develop * add test for matmul, test=develop 2020-01-15 16:36:15 +08:00			`"""`
			`Analyse whether the op node skips quantization.`
			`"""`
			`is_skip = False`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if op_node.op().has_attr("skip_quant") and op_node.op().attr(`
			`"skip_quant"`
			`):`
add mul and matmul quantization, test=develop (#22054) * add mul and matmul quantization, test=develop * add test for matmul, test=develop 2020-01-15 16:36:15 +08:00			`is_skip = True`
			`# if the inputs of mul and matmul are not all persistable, use`
			`# AddQuantDequantPass to quantize them.`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if op_node.name() in [`
			`"mul",`
			`"matmul",`
			`] and _is_input_all_not_persistable(graph, op_node):`
add mul and matmul quantization, test=develop (#22054) * add mul and matmul quantization, test=develop * add test for matmul, test=develop 2020-01-15 16:36:15 +08:00			`is_skip = True`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`op_node.op().has_attr("quantization_type")`
			`and op_node.op().attr("quantization_type") == "qat_without_weight"`
			`):`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`is_skip = True`
add mul and matmul quantization, test=develop (#22054) * add mul and matmul quantization, test=develop * add test for matmul, test=develop 2020-01-15 16:36:15 +08:00			`return is_skip`

add quantization freeze pass. 2019-01-24 16:00:10 +08:00
[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class QuantizationFreezePass:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def __init__(`
			`self,`
			`scope,`
			`place,`
			`bias_correction=False,`
			`weight_bits=8,`
			`activation_bits=8,`
			`round_type='round',`
			`weight_quantize_type='abs_max',`
			`quantizable_op_type=None,`
			`):`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`"""`
			`The freeze pass is used to adjust the quantize operator order, for example:`
fix typo words (#22653) 2020-02-23 13:41:52 +08:00			1) `activation -> quant -> dequant -> conv2d` will be frozen into
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`activation -> quant -> conv2d -> dequant`
fix typo words (#22653) 2020-02-23 13:41:52 +08:00			2) `weight -> quant -> dequant -> conv2d` will be frozen into `weight -> conv2d`,
			`and weight will be scaled offline.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00
			`Args:`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`scope(static.Scope): scope is used to get the weight tensor values.`
			`place(static.CPUPlace\|static.CUDAPlace\|str): place is used to restore the weight tensors.`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			If it's string, It can be ``cpu``, and ``gpu:x``, where ``x`` is the index of the GPUs.
add new post-quant methods (#32208) 2021-04-14 10:40:39 +08:00			`bias_correction(bool): whether use bias correction for post-training quantization.`
			`https://arxiv.org/abs/1810.05723.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`weight_bits(int): quantization bit number for weights.`
			`activation_bits(int): quantization bit number for activation.`
fix quantization clip and round Attribute (#43764) 2022-06-24 17:27:59 +08:00			`round_type(str, optional): The method of converting the quantized weights`
Update quantization round and clip calculation rules (#42695) 2022-06-21 19:12:05 +08:00			`value float->int. Currently supports ['round', 'adaround'] methods.`
			Default is `round`, which is rounding nearest to the integer.
			`'adaround' is refer to https://arxiv.org/abs/2004.10568.`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`weight_quantize_type(str): quantization type for weights, support 'abs_max' and`
			`'channel_wise_abs_max'. The 'range_abs_max' usually is not used for weight,`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`since weights are fixed once the model is well trained.`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`quantizable_op_type(list[str]): This input param will be removed latter. The pass`
			`will process all quantized op, so it is not necessary to set the input param.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`"""`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`assert scope is not None, 'The scope cannot be set None.'`
			`assert place is not None, 'The place cannot be set None.'`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`self._scope = scope`
add new post-quant methods (#32208) 2021-04-14 10:40:39 +08:00			`self._bias_correction = bias_correction`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			`self._place = _get_paddle_place(place)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`self._weight_bits = weight_bits`
			`self._activation_bits = activation_bits`
add adaround post-quant method (#38460) * add adaround post-quant method 2022-03-28 11:28:55 +08:00			`self._round_type = round_type`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`self._weight_quantize_type = weight_quantize_type`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`self._fake_quant_op_names = _fake_quant_op_list`
			`self._fake_dequant_op_names = _fake_dequant_op_list`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`self._op_input_rename_map = collections.OrderedDict()`
			`self._op_output_rename_map = collections.OrderedDict()`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`self._quant_var_scale_map = collections.OrderedDict()`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`self._quantized_ops = set()`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
			`def apply(self, graph):`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00			`"""`
			`Adjust quantize/dequantize operators order for the inference process.`

			`Args:`
			`graph(IrGraph): the applied graph.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`Returns:`
			`None`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00			`"""`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`# Get input scales in fake quant op and process weights`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`persistable_vars = [p.name() for p in graph.all_persistable_nodes()]`
			`ops = graph.all_op_nodes()`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`for op_node in ops:`
			`op_name = op_node.name()`
			`if op_name in self._fake_quant_op_names:`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`input_arg_name = op_node.input('X')[0]`
saving inference model when user define activation or weight preprocess function (#25749) * saving inference model for user defined quantization model * saving inference model for user defined quantization model * fixed ci coverage 2020-07-28 19:17:05 +08:00			`if hasattr(graph, 'out_node_mapping_table'):`
			`if input_arg_name in graph.out_node_mapping_table.keys():`
			`input_arg_name = graph.out_node_mapping_table[`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`input_arg_name`
			`]`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`if input_arg_name not in persistable_vars:`
			`scale_v = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.outputs, op_node.output('OutScale')[0]`
			`)`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`self._quant_var_scale_map[input_arg_name] = scale_v`
			`else:`
			`# Obtain scale from OutScale var node`
			`scale_v = self._load_var(op_node.output('OutScale')[0])`
			`assert scale_v.ndim in [`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`1,`
			`2,`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`], "the dim of scale_v should be 1 or 2"`
			`if scale_v.ndim == 2:`
			`scale_v = scale_v[0]`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`scale_v.size == 1`
			`and self._weight_quantize_type == 'abs_max'`
			`):`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`scale_v = scale_v[0]`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`else:`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`scale_v = scale_v.tolist()`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`self._quant_var_scale_map[input_arg_name] = scale_v`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`# Quantize weight and restore`
fix quantization clip and round Attribute (#43764) 2022-06-24 17:27:59 +08:00			`if self._round_type == 'round':`
Update quantization round and clip calculation rules (#42695) 2022-06-21 19:12:05 +08:00			`param_v = self._load_var(input_arg_name)`
fix the bug of quanting matmul (#52833) 2023-04-18 17:35:01 +08:00			`quant_axis = 0`
			`if op_node.op().has_attr('quant_axis'):`
			`quant_axis = op_node.op().attr('quant_axis')`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`if input_arg_name not in self._quantized_ops:`
			`self._quantized_ops.add(input_arg_name)`
			`quantized_param_v = utils.quant_tensor(`
			`param_v.copy(),`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`scale_v,`
			`quant_axis,`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`self._weight_bits,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
fix quantization clip and round Attribute (#43764) 2022-06-24 17:27:59 +08:00			`quantized_param_v = np.round(quantized_param_v)`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`# Weight bias correction`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`if self._bias_correction is True:`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`quantized_param_v = utils.bias_correction_w(`
			`param_v,`
			`quantized_param_v,`
			`scale_v,`
			`quant_axis,`
			`weight_bits=self._weight_bits,`
			`)`
			`quantized_param_v = np.round(quantized_param_v)`
			`self._restore_var(input_arg_name, quantized_param_v)`

[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`self._remove_fake_quant_and_dequant_op(graph, op_node)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
Add test attribute in channelwise_quant op, test=develop (#27742) * Add test attribute in channelwise_quant op, test=develop 2020-10-12 10:10:25 +08:00			`# Remove all fake dequant op`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`ops = graph.all_op_nodes()`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`for op_node in ops:`
			`op_name = op_node.name()`
			`if op_name in self._fake_dequant_op_names:`
			`self._remove_fake_quant_and_dequant_op(graph, op_node)`

Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`# Insert post dequant op`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`ops = graph.all_op_nodes()`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`for op_node in ops:`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`op_node_desc = op_node.op()`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`op_node_desc.has_attr("quantization_type")`
			`and op_node_desc.attr("quantization_type") == "qat_with_weight"`
			`):`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`if self._weight_quantize_type == 'channel_wise_abs_max':`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`quant_axis = (`
			`1`
			`if op_node.name() in utils._channelwise_quant_axis1_ops`
			`else 0`
			`)`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`self._insert_post_channel_dequant_op(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph, op_node, quant_axis`
			`)`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`else:`
			`self._insert_post_dequant_op(graph, op_node)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`# Rename inputs of the followed ops after inserting dequant_op after fc/conv`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`for op_node in ops:`
			`for var_node in op_node.inputs:`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`if var_node.node in self._op_output_rename_map:`
			`old_in = var_node`
			`new_in = self._op_output_rename_map[var_node.node]`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`graph.update_input_link(old_in, new_in, op_node)`

			`# remove the unused var node in the graph`
			`self._remove_unused_var_nodes(graph)`
Use the resolve hazard method. 2019-03-26 14:08:12 +08:00			`graph.resolve_hazard()`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`return graph`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
			`def _remove_fake_quant_and_dequant_op(self, graph, op_node):`
clean codes and fix some bugs. test=develop 2019-03-27 16:46:39 +08:00			`k = graph._find_node_by_name(op_node.outputs, op_node.output('Out')[0])`
			`v = graph._find_node_by_name(op_node.inputs, op_node.input('X')[0])`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`if v.node not in self._op_input_rename_map:`
			`self._op_input_rename_map[k.node] = v`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`else:`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`self._op_input_rename_map[k.node] = self._op_input_rename_map[`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`v.node`
			`]`
add freeze pass, and UT is passed. 2019-01-26 00:31:12 +08:00			`graph.safe_remove_nodes(op_node)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
[ pass_enhance ]quant_conv2d_dequant_fuse_pass (#33737) 2021-06-25 16:27:36 +08:00			`def _insert_post_channel_dequant_op(self, graph, op_node, quant_axis):`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`persistable_vars = [p.name() for p in graph.all_persistable_nodes()]`
			`for var_node in op_node.inputs:`
			`name = var_node.name()`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`if name not in op_node.input_arg_names():`
			`continue`
			`if var_node.node in self._op_input_rename_map:`
			`old_in = var_node`
			`new_in = self._op_input_rename_map[var_node.node]`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`new_in.clear_outputs()`
			`graph.update_input_link(old_in, new_in, op_node)`
			`original_var_name = self._original_var_name(name)`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`scale_v = self._quant_var_scale_map[original_var_name]`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`if original_var_name in persistable_vars:`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(scale_v, list), (`
			`f'The scale of parameter {original_var_name} is not a list.'`
			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`channel_scale = np.array(scale_v)`
			`else:`
			`assert isinstance(scale_v, IrNode)`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`scale_var_node = self._quant_var_scale_map[original_var_name]`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`if len(op_node.output_arg_names()) != 1:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`raise ValueError(`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f"Only support one output, but op {op_node.name()} has"`
			`" more than one output."`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00
clean codes and fix some bugs. test=develop 2019-03-27 16:46:39 +08:00			`output_var_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.outputs, op_node.output_arg_names()[0]`
			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`weight_scale_node = graph.create_persistable_node(`
			`name=unique_name.generate('channel_scale'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`shape=[channel_scale.shape[0]],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=output_var_node.dtype(),`
			`)`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if output_var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif output_var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`_init_var_node(`
			`weight_scale_node,`
			`channel_scale.astype(data_type),`
			`self._scope,`
			`self._place,`
			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`dequant_var_node = graph.create_var_node(`
			`name=self._dequantized_var_name(output_var_node.name()),`
			`var_type=output_var_node.type(),`
			`shape=output_var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=output_var_node.dtype(),`
			`)`
Fix a bug of quantization (#36982) * fix a quantization bug 2021-11-04 19:48:31 +08:00			`x_num_col_dims = 1`
			`if op_node.name() in ['matmul', 'matmul_v2', 'mul']:`
			`x_num_col_dims = len(op_node.outputs[0].shape()) - 1`
fix the bug of channel-wise quantization for ernie (#34948) 2021-08-26 10:30:45 +08:00			`if op_node.op().has_attr("x_num_col_dims"):`
			`x_num_col_dims = op_node.op().attr("x_num_col_dims")`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`dequant_op_node = graph.create_op_node(`
			`op_type='fake_channel_wise_dequantize_max_abs',`
			`attrs={`
			`'quant_bits': [self._weight_bits, self._activation_bits],`
[ pass_enhance ]quant_conv2d_dequant_fuse_pass (#33737) 2021-06-25 16:27:36 +08:00			`'quant_axis': quant_axis,`
fix the bug of channel-wise quantization for ernie (#34948) 2021-08-26 10:30:45 +08:00			`'op_role': core.op_proto_and_checker_maker.OpRole.Forward,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`'x_num_col_dims': x_num_col_dims,`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`},`
			`inputs={`
			`'X': output_var_node,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`'Scales': [weight_scale_node, scale_var_node],`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`outputs={'Out': dequant_var_node},`
			`)`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`graph.link_to(output_var_node, dequant_op_node)`
			`graph.link_to(scale_var_node, dequant_op_node)`
			`graph.link_to(weight_scale_node, dequant_op_node)`
			`graph.link_to(dequant_op_node, dequant_var_node)`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`self._op_output_rename_map[output_var_node.node] = dequant_var_node`
add channel wise quantization in ir pass. 2019-03-19 22:42:26 +08:00			`return dequant_var_node`

add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`def _insert_post_dequant_op(self, graph, op_node):`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`persistable_vars = [p.name() for p in graph.all_persistable_nodes()]`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`max_range = 1`
			`param_range = (1 << (self._weight_bits - 1)) - 1`
			`act_range = (1 << (self._activation_bits - 1)) - 1`
add freeze pass, and UT is passed. 2019-01-26 00:31:12 +08:00			`for var_node in op_node.inputs:`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`name = var_node.name()`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`if name not in op_node.input_arg_names():`
			`continue`
			`if var_node.node in self._op_input_rename_map:`
			`old_in = var_node`
			`new_in = self._op_input_rename_map[var_node.node]`
add freeze pass, and UT is passed. 2019-01-26 00:31:12 +08:00			`new_in.clear_outputs()`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`graph.update_input_link(old_in, new_in, op_node)`
			`original_var_name = self._original_var_name(name)`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`scale_v = self._quant_var_scale_map[original_var_name]`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`if original_var_name in persistable_vars:`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert self._is_float(scale_v), (`
			`f'The scale of parameter {original_var_name} is not a float.'`
			`)`
fix a quantization bug (#34647) 2021-08-10 16:35:16 +08:00			`scale_v = 1e-8 if scale_v == 0.0 else scale_v`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`max_range *= param_range / scale_v`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`else:`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`max_range *= act_range`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`assert isinstance(scale_v, IrNode)`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`scale_var_node = self._quant_var_scale_map[original_var_name]`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`if len(op_node.output_arg_names()) != 1:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`raise ValueError(`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f"Only support one output, but op {op_node.name()} has"`
			`" more than one output."`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
clean codes and fix some bugs. test=develop 2019-03-27 16:46:39 +08:00			`output_var_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.outputs, op_node.output_arg_names()[0]`
			`)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`dequant_var_node = graph.create_var_node(`
			`name=self._dequantized_var_name(output_var_node.name()),`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`var_type=output_var_node.type(),`
			`shape=output_var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=output_var_node.dtype(),`
			`)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`dequant_op_node = graph.create_op_node(`
			`op_type='fake_dequantize_max_abs',`
add op_role attr when creating op node. 2019-01-28 19:26:02 +08:00			`attrs={`
			`'max_range': float(max_range),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`'op_role': core.op_proto_and_checker_maker.OpRole.Forward,`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`inputs={'X': output_var_node, 'Scale': scale_var_node},`
			`outputs={'Out': dequant_var_node},`
			`)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`graph.link_to(output_var_node, dequant_op_node)`
			`graph.link_to(scale_var_node, dequant_op_node)`
			`graph.link_to(dequant_op_node, dequant_var_node)`
Fix some bugs for quantization passes. 2019-03-25 22:47:58 +08:00			`self._op_output_rename_map[output_var_node.node] = dequant_var_node`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`return dequant_var_node`

			`def _load_var(self, name):`
			`return np.array(self._scope.find_var(name).get_tensor())`

add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`def _restore_var(self, name, array):`
			`tensor = self._scope.find_var(name).get_tensor()`
			`tensor.set(array, self._place)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
			`def _remove_unused_var_nodes(self, graph):`
			`all_used_vars = set()`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`ops = graph.all_op_nodes()`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`for op_node in ops:`
			`for input_node in op_node.inputs:`
			`all_used_vars.add(input_node)`
			`for output_node in op_node.outputs:`
			`all_used_vars.add(output_node)`

add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`all_used_vars = {n.node for n in all_used_vars}`
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-30 10:17:11 +08:00			`all_unused_vars = set(`
			`filter(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`lambda node: node.node not in all_used_vars,`
			`graph.all_var_nodes(),`
			`)`
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-30 10:17:11 +08:00			`)`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`graph.safe_remove_nodes(all_unused_vars)`

			`def _original_var_name(self, var_name):`
			`"""`
			`Return the original variable name.`
			`"""`
			`if var_name.endswith('.quantized.dequantized'):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`return var_name[: -len('.quantized.dequantized')]`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`if var_name.endswith('.quantized'):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`return var_name[: -len('.quantized')]`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`if var_name.endswith('.dequantized'):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`return var_name[: -len('.dequantized')]`
fix quant scale name (#44116) 2022-07-06 16:38:48 +08:00			`if var_name.endswith('@scale'):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`return var_name[: -len('@scale')]`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00			`else:`
			`return var_name`

			`def _dequantized_var_name(self, var_name):`
			`"""`
			Return dequantized variable name for the input `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}.dequantized"`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
add freeze pass, and UT is passed. 2019-01-26 00:31:12 +08:00			`def _is_float(self, v):`
[CodeStyle][PLR1701] unify multiple isinstance expressions as one (#52150) 2023-03-28 14:40:48 +08:00			`return isinstance(v, (float, np.float16, np.float32, np.float64))`
add quantization freeze pass. 2019-01-24 16:00:10 +08:00
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00
[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class ConvertToInt8Pass:`
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`def __init__(self, scope, place, quantizable_op_type=None):`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`"""`
			`Convert the weights into int8_t type.`

			`Args:`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`scope(static.Scope): scope is used to get the weight tensor values.`
			`place(static.CPUPlace\|static.CUDAPlace\|str): place is used to restore the`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			8bits weight tensors. If it's string, It can be ``cpu``, and ``gpu:x``,
			where ``x`` is the index of the GPUs.
Post_training_quantizaion supports min_max methon (#23078) * Post_training_quantizaion supports min_max methon 2020-03-24 17:08:22 +08:00			`quantizable_op_type(list[str]): This input param will be removed latter. The pass`
			`will process all quantized op, so it is not necessary to set the input param.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`"""`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`assert scope is not None, 'The scope cannot be set None.'`
			`assert place is not None, 'The place cannot be set None.'`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`self._scope = scope`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			`self._place = _get_paddle_place(place)`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00
			`def apply(self, graph):`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00			`"""`
fix typo words (#22653) 2020-02-23 13:41:52 +08:00			`Convert weights' type of the graph. After that, the data type of the`
			`graph weights is int8_t.`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00
			`Args:`
			`graph(IrGraph): the applied graph.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`Returns:`
			`None`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00			`"""`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`persistable_vars = [p.name() for p in graph.all_persistable_nodes()]`
			`ops = graph.all_op_nodes()`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`input_map = {}`
			`for op_node in ops:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`op_node.op().has_attr("quantization_type")`
			`and op_node.op().attr("quantization_type") == "qat_with_weight"`
			`):`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`for var_node in op_node.inputs:`
			`name = var_node.name()`
			`if name in persistable_vars:`
			`if name not in input_map:`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`int8_var_node = self._convert_to_int8(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph, var_node`
			`)`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`input_map[name] = int8_var_node`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph.update_input_link(`
			`var_node, input_map[name], op_node`
			`)`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00
			`# remove the unused var node in the graph`
			`self._remove_unused_var_nodes(graph)`
Use the resolve hazard method. 2019-03-26 14:08:12 +08:00			`graph.resolve_hazard()`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`return graph`

			`def _convert_to_int8(self, graph, var_node):`
			`int8_var_node_name = var_node.name() + ".int8"`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`int8_var_node = graph.create_persistable_node(`
[CodeStyle][py2] remove `compat` module (to_text) (#47036) * [CodeStyle][py2] remove `compat` module (to_text) * remove some unnecessary decode * remove to_text definition and unittest * Revert "remove to_text definition and unittest" This reverts commit a6b69cb8dca8b9b031ce10ea32d1040e7e0dd267. * remove an assertion * empty commit 2022-10-18 10:27:15 +08:00			`name=int8_var_node_name,`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=core.VarDesc.VarType.INT8,`
			`)`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`array = self._load_var(var_node.name())`
			`self._scope.var(int8_var_node_name)`
			`self._store_var(int8_var_node_name, array, np.int8)`
			`return int8_var_node`

			`def _load_var(self, name):`
			`return np.array(self._scope.find_var(name).get_tensor())`

			`def _store_var(self, name, array, dtype):`
			`tensor = self._scope.find_var(name).get_tensor()`
			`tensor.set(array.astype(dtype), self._place)`

			`def _remove_unused_var_nodes(self, graph):`
			`all_used_vars = set()`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`ops = graph.all_op_nodes()`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`for op_node in ops:`
			`for input_node in op_node.inputs:`
			`all_used_vars.add(input_node)`
			`for output_node in op_node.outputs:`
			`all_used_vars.add(output_node)`

add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`all_used_vars = {n.node for n in all_used_vars}`
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-30 10:17:11 +08:00			`all_unused_vars = set(`
			`filter(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`lambda node: node.node not in all_used_vars,`
			`graph.all_var_nodes(),`
			`)`
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-30 10:17:11 +08:00			`)`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`graph.safe_remove_nodes(all_unused_vars)`


[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class TransformForMobilePass:`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`def __init__(self):`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`"""`
fix typo words (#22653) 2020-02-23 13:41:52 +08:00			`This pass is used to convert the frozen graph for paddle-mobile execution.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`"""`
Add the max-pool2d quantization support and the partial quantization support. (#19310) * add pool2d quantization support, only for max-pooling. * add the partial quantization support. 2019-08-23 10:40:29 +08:00			`self._fake_quant_op_names = _fake_quant_op_list`
			`self._fake_dequant_op_names = _fake_dequant_op_list`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00
			`def apply(self, graph):`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00			`"""`
			Because paddle-mobile use `quantize` an `dequantize` as the names of
			quantize operator and dequantize operator, the `apply` function just
			`realize this logic.`

			`Args:`
			`graph(IrGraph): the graph will be transformed.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`Returns:`
			`None`
add comments for public API. test=develop 2019-01-30 23:30:19 +08:00			`"""`
add IrNode&IrVarNode&IrOpNode. test=develop 2019-02-20 22:43:25 +08:00			`ops = graph.all_op_nodes()`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`for op_node in ops:`
			`name = op_node.name()`
			`if name in self._fake_quant_op_names:`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`op_node.set_type('quantize')`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`quant_node = graph.create_op_node_from_desc(op_node.op())`
			`for input_node in op_node.inputs:`
			`graph.link_to(input_node, quant_node)`
			`for output_node in op_node.outputs:`
			`graph.link_to(quant_node, output_node)`
			`graph.safe_remove_nodes(op_node)`
			`if name in self._fake_dequant_op_names:`
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop 2019-02-21 15:02:52 +08:00			`op_node.set_type('dequantize')`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`dequant_node = graph.create_op_node_from_desc(op_node.op())`
			`for input_node in op_node.inputs:`
			`graph.link_to(input_node, dequant_node)`
			`for output_node in op_node.outputs:`
			`graph.link_to(dequant_node, output_node)`
			`graph.safe_remove_nodes(op_node)`
Use the resolve hazard method. 2019-03-26 14:08:12 +08:00			`graph.resolve_hazard()`
add convert_to_int8 pass and transform_for_mobile pass and their UTs. 2019-01-26 15:56:54 +08:00			`return graph`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00

[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class OutScaleForTrainingPass:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def __init__(`
			`self,`
			`scope=None,`
			`place=None,`
			`moving_rate=0.9,`
			`is_test=None,`
			`scale_dict=None,`
			`):`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00			`"""`
			`This pass is used for calculating output scales of some operators.`
			`These output scales may be used by tensorRT or some other inference engines.`

			`Args:`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`scope(static.Scope): The scope is used to initialize these new parameters.`
			`place(static.CPUPlace\|static.CUDAPlace\|str): The place is used to initialize new parameters.`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			If it's string, It can be ``cpu``, and ``gpu:x``, where ``x`` is the
			`index of the GPUs.`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00			`moving_rate(float): The decay coefficient of moving average. The default value is 0.9.`
			`"""`
			`self._scope = scope`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			`self._place = _get_paddle_place(place)`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00			`self._moving_rate = moving_rate`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`self._is_test = is_test`
Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`self._teller_set = list(SUPPORT_QUANTIZATION_OP_DICT.keys())`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`self._scale_dict = scale_dict`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00
			`def apply(self, graph):`
			`"""`
			Insert the `moving_average_abs_max_scale` op in order to calculate output scales
			`of operators in the teller_set.`

			`Args:`
			`graph(IrGraph): the target graph.`
			`"""`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`if self._is_test is None:`
			`self._is_test = graph.is_test()`
Add output threshold for ops that have several output activations, test=develop (#24726) 2020-05-26 12:52:14 +08:00			`target_ops = []`
			`for op in graph.all_op_nodes():`
			`if op.name() in self._teller_set:`
			`target_ops.append(op)`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`with tqdm(`
			`total=len(target_ops),`
			`bar_format='Adding OutScale op:\|{bar}\| {n_fmt}/{total_fmt}',`
			`ncols=80,`
			`) as t:`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`for op in target_ops:`
			`for output_var_name in utils._get_op_output_var_names(op):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`in_node = graph._find_node_by_name(`
			`op.outputs, output_var_name`
			`)`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`if (`
			`in_node.dtype()`
			`not in [`
			`core.VarDesc.VarType.FP64,`
			`core.VarDesc.VarType.FP32,`
			`core.VarDesc.VarType.FP16,`
			`]`
			`or '@GRAD' in in_node.name()`
			`):`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`continue`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if in_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif in_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`

fix acc diff problem caused by pr #44116 (#44311) 2022-07-14 10:34:59 +08:00			`try:`
fix bug in PostTrainingProgram for certain cases (#45616) * fix bug in PostTrainingProgram for certain cases 2022-09-05 17:10:05 +08:00			`graph._find_node_by_name(`
fix acc diff problem caused by pr #44116 (#44311) 2022-07-14 10:34:59 +08:00			`graph.all_var_nodes(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`self._scale_name(in_node.name()),`
			`)`
fix bug in PostTrainingProgram for certain cases (#45616) * fix bug in PostTrainingProgram for certain cases 2022-09-05 17:10:05 +08:00			`continue`
fix acc diff problem caused by pr #44116 (#44311) 2022-07-14 10:34:59 +08:00			`except:`
			`scale_node = graph.create_persistable_node(`
			`name=self._scale_name(in_node.name()),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
fix acc diff problem caused by pr #44116 (#44311) 2022-07-14 10:34:59 +08:00			`shape=[1],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=in_node.dtype(),`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`if self._scale_dict is not None:`
			`try:`
			`scale_value = np.array(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`[self._scale_dict[in_node.name()]]`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`except:`
			`scale_value = np.ones([1], dtype=data_type)`
			`else:`
			`scale_value = np.ones([1], dtype=data_type)`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`_init_var_node(`
			`scale_node, scale_value, self._scope, self._place`
			`)`
fix acc diff problem caused by pr #44116 (#44311) 2022-07-14 10:34:59 +08:00
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`ins = {'X': in_node}`
			`outs = {'OutScale': scale_node}`
			`if not self._is_test:`
			`state_in_node = graph.create_persistable_node(`
			`name=unique_name.generate('scale_state@'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`var_dtype=in_node.dtype(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`shape=[1],`
			`)`
			`_init_var_node(`
			`state_in_node,`
			`np.ones([1], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`accum_in_node = graph.create_persistable_node(`
			`name=unique_name.generate('scale_accum@'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`var_dtype=in_node.dtype(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`shape=[1],`
			`)`
			`_init_var_node(`
			`accum_in_node,`
			`np.ones([1], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`state_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`state_in_node.var()`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`accum_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`accum_in_node.var()`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00
			`ins['InState'] = state_in_node`
			`ins['InAccum'] = accum_in_node`
			`outs['OutState'] = state_out_node`
			`outs['OutAccum'] = accum_out_node`

			`attrs = {`
			`'moving_rate': self._moving_rate,`
			`'is_test': self._is_test,`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`'op_role': op.op().attr("op_role"),`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`}`
			`scale_op_node = graph.create_op_node(`
			`op_type='moving_average_abs_max_scale',`
			`attrs=attrs,`
			`inputs=ins,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`outputs=outs,`
			`)`
update ma output link in graph (#46142) 2022-09-20 15:57:44 +08:00
			`next_op_node = None`
			`if len(in_node.outputs) > 0:`
			`next_op_node = in_node.outputs[0]`

Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`graph.link_to(in_node, scale_op_node)`
			`graph.link_to(scale_op_node, scale_node)`
update ma output link in graph (#46142) 2022-09-20 15:57:44 +08:00			`if next_op_node:`
			`graph.link_to(scale_node, next_op_node)`

Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`if not self._is_test:`
			`graph.link_to(state_in_node, scale_op_node)`
			`graph.link_to(accum_in_node, scale_op_node)`
			`graph.link_to(scale_op_node, state_out_node)`
			`graph.link_to(scale_op_node, accum_out_node)`
			`t.update()`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00			`return graph`

			`def _scale_name(self, var_name):`
			`"""`
			Return the scale name for the var named `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}@scale"`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00

[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class OutScaleForInferencePass:`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00			`def __init__(self, scope=None):`
			`"""`
			`This pass is used for setting output scales of some operators.`
			`These output scales may be used by tensorRT or some other inference engines.`

			`Args:`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`scope(static.Scope): The scope is used to initialize these new parameters.`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00			`"""`
			`self._scope = scope`
Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`self._teller_set = list(SUPPORT_QUANTIZATION_OP_DICT.keys())`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00
			`def apply(self, graph):`
			`"""`
			`Get output scales from the scope and set these scales in op_descs`
			`of operators in the teller_set.`

			`Args:`
			`graph(IrGraph): the target graph.`
			`"""`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
Add output threshold for ops that have several output activations, test=develop (#24726) 2020-05-26 12:52:14 +08:00			`op_nodes = graph.all_op_nodes()`
			`for op_node in op_nodes:`
			`if op_node.name() in self._teller_set:`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`var_names = utils._get_op_output_var_names(op_node)`
[Quantization] Save output threshold by argname_index (#25272) * Save output threshold by argname_index, test=develop 2020-07-06 10:31:50 +08:00			`for var_name in var_names:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`in_node = graph._find_node_by_name(`
			`op_node.outputs, var_name`
			`)`
fix quant pass (#49123) 2022-12-19 10:38:06 +08:00			`if (in_node.node.var() is None) or (`
			`in_node.dtype()`
			`not in [`
			`core.VarDesc.VarType.FP64,`
			`core.VarDesc.VarType.FP32,`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`core.VarDesc.VarType.FP16,`
fix quant pass (#49123) 2022-12-19 10:38:06 +08:00			`]`
			`):`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`continue`

[Quantization] Save output threshold by argname_index (#25272) * Save output threshold by argname_index, test=develop 2020-07-06 10:31:50 +08:00			`scale_name = self._scale_name(var_name)`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`scale_var = self._scope.find_var(scale_name)`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert scale_var is not None, (`
			`f"Can not find {scale_name} variable in the scope"`
			`)`
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop 2020-08-19 10:14:46 +08:00			`scale_value = np.array(scale_var.get_tensor())[0]`

			`# For compatibility, we save output threshold by two methods.`
			`op_node.op()._set_attr("out_threshold", float(scale_value))`
[Quantization] Save output threshold by argname_index (#25272) * Save output threshold by argname_index, test=develop 2020-07-06 10:31:50 +08:00
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`argname_index = utils._get_output_name_index(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node, var_name`
			`)`
			`assert argname_index is not None, (`
[Quantization] Save output threshold by argname_index (#25272) * Save output threshold by argname_index, test=develop 2020-07-06 10:31:50 +08:00			`var_name + " is not the output of the op"`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
			`op_node.op()._set_attr(`
			`argname_index[0] + str(argname_index[1]) + "_threshold",`
			`float(scale_value),`
			`)`
Set attribute "with_quant_attr" into quantized operators (#35583) 2021-09-10 19:03:56 +08:00			`op_node.op()._set_attr("with_quant_attr", True)`
add scale pass for calculating the output scales.test=develop (#17259) 2019-05-08 10:41:28 +08:00			`graph.resolve_hazard()`
			`return graph`

			`def _scale_name(self, var_name):`
			`"""`
			Return the scale name for the var named `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}@scale"`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00

[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class AddQuantDequantPass:`
Collect output scale for quantized op and fused op (#23369) * Collect output scale for quantized op and fused op * Post_training_quantizaion sets batch_generator to support lod tensor 2020-04-07 16:47:58 +08:00			`"""`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`Quantize the ops that do not have weights, and add quant_dequant op for the`
Collect output scale for quantized op and fused op (#23369) * Collect output scale for quantized op and fused op * Post_training_quantizaion sets batch_generator to support lod tensor 2020-04-07 16:47:58 +08:00			`quantized ops's inputs.`
			`"""`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00
Add activation_type in AddQuantDequantPass to be compatible with paddleslim, test=develop (#23221) 2020-03-26 11:12:30 +08:00			`# To be compatible with PaddleSlim, not remove _activation_type for now`
			`_activation_type = ["relu", "relu6", "leaky_relu", "tanh", "swish"]`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def __init__(`
			`self,`
			`scope=None,`
			`place=None,`
			`moving_rate=0.9,`
			`quant_bits=8,`
			`skip_pattern=["skip_quant"],`
			`quantizable_op_type=["elementwise_add", "pool2d"],`
			`is_test=None,`
			`scale_dict=None,`
			`):`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`"""`
Collect output scale for quantized op and fused op (#23369) * Collect output scale for quantized op and fused op * Post_training_quantizaion sets batch_generator to support lod tensor 2020-04-07 16:47:58 +08:00			`Constructor.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00
			`Args:`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`scope(static.Scope): The scope is used to initialize these new parameters.`
			`place(static.CPUPlace\|static.CUDAPlace\|str): place is used to initialize new`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			parameters described above. If ``place`` is string, it can be It can be ``cpu``
			or ``gpu:x``, where ``x`` is the index of the GPUs.
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`moving_rate(float, optional): the param for 'quant_dequant_moving_average_abs_max'`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`quantization. Default is 0.9.`
			`quant_bits(int, optional): quantization bit number for activation. Default is 8.`
			`skip_pattern(str, optional): The user-defined quantization skip pattern, which`
			`will be presented in the name scope of an op. When the skip pattern is`
			`detected in an op's name scope, the corresponding op will not be quantized.`
			`Default is 'skip_quant'.`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`quantizable_op_type(list[str], optional): List the type of ops that will be`
			`quantized. Default is ["elementwise_add", "pool2d"].`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`"""`
			`self._scope = scope`
refine the paddle place support using str (#28769) 2021-01-07 14:40:58 +08:00			`self._place = _get_paddle_place(place)`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`self._moving_rate = moving_rate`
			`self._quant_bits = quant_bits`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`self._is_test = is_test`
Move pool2d to add_quant_dequant_pass, test=develop (#20586) * move pool2d to add_quant_dequant_pass, test=develop 2019-10-16 20:51:15 +08:00			`self._skip_pattern = skip_pattern`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`self._scale_dict = scale_dict`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00
Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`self._quantizable_op_type = quantizable_op_type`
			`for op_type in self._quantizable_op_type:`
			`assert op_type in list(SUPPORT_ACT_QUANTIZATION_OP_DICT.keys()), (`
			`op_type + " is not supported for quantization."`
			`)`
Add post_training_quantization (#20800) * add post training quantization, test=develop * specify the quantizable op type, test=develop 2019-11-05 15:18:31 +08:00			`self._quantizable_grad_op_type = [`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f'{op}_grad' for op in self._quantizable_op_type`
Add post_training_quantization (#20800) * add post training quantization, test=develop * specify the quantizable op type, test=develop 2019-11-05 15:18:31 +08:00			`]`

[CodeStyle][E711] use `is`/`is not` for comparison with `None` (#47452) * [CodeStyle][E711] use `is`/`is not` for comparison with `None` * `self.assertTrue($A is None)` -> `self.assertIsNone($A)` * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)` * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)` * `self.assertEqual($A, None)` -> `self.assertIsNone($A)` * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)` 2022-11-01 22:14:52 +08:00			`assert self._scope is not None, "scope must not be None."`
			`assert self._place is not None, "place must not be None."`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00
			`def apply(self, graph):`
			`"""`
delete concat in AddQuantDequantPass, test=develop (#21454) 2019-12-02 17:38:16 +08:00			`Add quant_dequant before some ops, such as the 'elementwise_add' and`
			`'pool2d' op.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`Args:`
			`graph(IrGraph): the target graph.`
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`Returns:`
			`None`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`"""`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`if self._is_test is None:`
			`self._is_test = graph.is_test()`
add fake_quant_dequant_op for average pool2d, test=develop (#19880) * add fake_quant_dequant_op for average pool2d * add test 2019-09-23 21:22:31 +08:00			`dequantized_vars_map = collections.OrderedDict()`

Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`# Forward stage, insert quant_dequant op`
			`all_op_nodes = graph.all_op_nodes()`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`with tqdm(`
			`total=len(all_op_nodes),`
			`bar_format='Adding quant activation op:\|{bar}\| {n_fmt}/{total_fmt}',`
			`ncols=80,`
			`) as t:`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`for op_node in all_op_nodes:`
			`if op_node.name() in self._quantizable_op_type:`
			`is_skip = False`
			`if isinstance(self._skip_pattern, list):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`is_skip = op_node.op().has_attr("op_namescope") and any(`
			`pattern in op_node.op().attr("op_namescope")`
			`for pattern in self._skip_pattern`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`elif isinstance(self._skip_pattern, str):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`is_skip = (`
			`op_node.op().has_attr("op_namescope")`
			`and op_node.op()`
			`.attr("op_namescope")`
			`.find(self._skip_pattern)`
			`!= -1`
			`)`
			`is_quantized = (`
			`op_node.op().has_attr("quantization_type")`
			`and op_node.op().attr("quantization_type")`
			`== "qat_with_weight"`
			`)`
			`if (`
			`is_skip`
			`or is_quantized`
			`or (not _is_input_all_not_persistable(graph, op_node))`
			`):`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`continue`
add fake_quant_dequant_op for average pool2d, test=develop (#19880) * add fake_quant_dequant_op for average pool2d * add test 2019-09-23 21:22:31 +08:00
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.op()._set_attr(`
			`"quantization_type", "qat_without_weight"`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`op_node.op()._set_attr("activation_bits", self._quant_bits)`
			`op_node.op()._set_attr("with_quant_attr", True)`
			`arg_names = utils._get_op_input_var_names(op_node)`
Support matmul in QAT and loading quantized models in PTQ (#47892) 2022-12-05 15:14:18 +08:00			`# If already quanted, skip it.`
			`skip_quant = False`
			`for arg_name in arg_names:`
			`if "quantized.dequantized" in arg_name:`
			`skip_quant = True`
			`break`
			`if skip_quant:`
			`continue`

Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`for arg_name in arg_names:`
			`in_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.inputs, arg_name`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`if arg_name in dequantized_vars_map:`
			`quant_var_node = dequantized_vars_map[arg_name]`
			`else:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`(`
			`quant_var_node,`
			`_,`
[CodeStyle][Typos][I-[21-31]] Fix typo(`initilized`,`initalized`,`initalize`,`intialize`,`inital`,`Inouts`,`intputs`,`inputed`,`intput`,`Intput`,`inser`,`insid`,`insepection`,`intall`,`instanciate`) (#70725) * fix typos i21-i31 * fix ins id * setInputs to setInOuts 2025-01-13 14:13:13 +08:00			`) = self._insert_quant_dequant_moving_average_abs_max_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph,`
			`in_node,`
			`self._quant_bits,`
			`op_node.op().attr("op_role"),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`dequantized_vars_map[arg_name] = quant_var_node`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph.update_input_link(`
			`in_node, quant_var_node, op_node`
			`)`
fix AddQuantDequantPass indent (#48550) 2022-11-30 15:50:48 +08:00			`t.update()`
add fake_quant_dequant_op for average pool2d, test=develop (#19880) * add fake_quant_dequant_op for average pool2d * add test 2019-09-23 21:22:31 +08:00
Support more ops in post training quantization, test=develop (#21073) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat 2019-11-16 07:41:46 +08:00			`# Backward stage, update input link`
			`for op_node in all_op_nodes:`
Add post_training_quantization (#20800) * add post training quantization, test=develop * specify the quantizable op type, test=develop 2019-11-05 15:18:31 +08:00			`if op_node.name() in self._quantizable_grad_op_type:`
add fake_quant_dequant_op for average pool2d, test=develop (#19880) * add fake_quant_dequant_op for average pool2d * add test 2019-09-23 21:22:31 +08:00			`for input_name in op_node.input_arg_names():`
			`if input_name in dequantized_vars_map:`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`in_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.inputs, input_name`
			`)`
add fake_quant_dequant_op for average pool2d, test=develop (#19880) * add fake_quant_dequant_op for average pool2d * add test 2019-09-23 21:22:31 +08:00			`dequant_var_node = dequantized_vars_map[input_name]`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph.update_input_link(`
			`in_node, dequant_var_node, op_node`
			`)`
add fake_quant_dequant_op for average pool2d, test=develop (#19880) * add fake_quant_dequant_op for average pool2d * add test 2019-09-23 21:22:31 +08:00
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`graph.resolve_hazard()`
			`return graph`

[CodeStyle][Typos][I-[21-31]] Fix typo(`initilized`,`initalized`,`initalize`,`intialize`,`inital`,`Inouts`,`intputs`,`inputed`,`intput`,`Intput`,`inser`,`insid`,`insepection`,`intall`,`instanciate`) (#70725) * fix typos i21-i31 * fix ins id * setInputs to setInOuts 2025-01-13 14:13:13 +08:00			`def _insert_quant_dequant_moving_average_abs_max_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`self, graph, var_node, quant_bits, op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`):`
			`"""Insert fake_quantize_dequantize_moving_average_abs_max op."""`
			`quant_var_node = graph.create_var_node(`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`name=f"{var_node.name()}.quant_dequant",`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
			`var_dtype=var_node.dtype(),`
			`)`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`scale_name = f"{var_node.name()}.quant_dequant@scale"`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`try:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`self._scale_dict is not None`
			`and var_node.name() in self._scale_dict.keys()`
			`):`
			`scale_value = np.array(`
			`[self._scale_dict[var_node.name()]], dtype=data_type`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`else:`
			`scale_value = np.array(`
			`self._scope.find_var(scale_name).get_tensor(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`dtype=data_type,`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`except:`
			`scale_value = np.array([_SCALE_DEFAULT_VALUE], dtype=data_type)`

Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`scale_in_node = graph.create_persistable_node(`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`name=f"{var_node.name()}.quant_dequant@scale",`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`shape=[1],`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`_init_var_node(scale_in_node, scale_value, self._scope, self._place)`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`scale_out_node = graph.create_var_node_from_desc(scale_in_node.var())`
			`ins = {'X': var_node, 'InScale': scale_in_node}`
			`outs = {'Out': quant_var_node, 'OutScale': scale_out_node}`
			`if not self._is_test:`
			`state_in_node = graph.create_persistable_node(`
			`name=unique_name.generate('quant_dequant.state'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`var_dtype=var_node.dtype(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`shape=[1],`
			`)`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`_init_var_node(`
			`state_in_node,`
			`np.ones([1], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`accum_in_node = graph.create_persistable_node(`
			`name=unique_name.generate('quant_dequant.accum'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`var_dtype=var_node.dtype(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`shape=[1],`
			`)`
			`_init_var_node(`
			`accum_in_node,`
			`np.ones([1], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`state_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`state_in_node.var()`
			`)`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`accum_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`accum_in_node.var()`
			`)`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00
			`ins['InState'] = state_in_node`
			`ins['InAccum'] = accum_in_node`
			`outs['OutState'] = state_out_node`
			`outs['OutAccum'] = accum_out_node`

			`attrs = {`
			`'bit_length': quant_bits,`
			`'moving_rate': self._moving_rate,`
			`'is_test': self._is_test,`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`'op_role': op_role,`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00			`}`

			`quant_op_node = graph.create_op_node(`
			`op_type='fake_quantize_dequantize_moving_average_abs_max',`
			`attrs=attrs,`
			`inputs=ins,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`outputs=outs,`
			`)`
Adding AddQuantDequantPass for TensorRT int8 (#17529) * add quant_dequant_pass, test=develop * Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop 2019-05-22 10:56:44 +08:00
			`graph.link_to(var_node, quant_op_node)`
			`graph.link_to(scale_in_node, quant_op_node)`
			`graph.link_to(quant_op_node, quant_var_node)`
			`graph.link_to(quant_op_node, scale_out_node)`

			`if not self._is_test:`
			`graph.link_to(state_in_node, quant_op_node)`
			`graph.link_to(accum_in_node, quant_op_node)`
			`graph.link_to(quant_op_node, state_out_node)`
			`graph.link_to(quant_op_node, accum_out_node)`

			`return quant_var_node, scale_out_node`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00

[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class InsertQuantizeLinear:`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`Insert quantize_linear and dequantize_linear op before ops.`

			`Args:`
			`place(paddle.CPUPlace\|paddle.CUDAPlace\|str): place is used to restore the weight tensors.`
			If it's string, It can be ``cpu``, and ``gpu:x``, where ``x`` is the index of the GPUs.
			`scope(paddle.Scope): scope is used to get the weight tensor values.`
			`quant_bits(int, optional): quantization bit number for weight. Default is 8.`
			`quant_axis(int, optional): quantization dimension of channels. When it is greater than or`
			`equal to 0, it will quantization with per channel, else quantization with per layer.`
			`Default is -1.`
			`channel_wise(bool, optional): Whether quantization with per channel or not. Default is False.`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`moving_rate(float): the rate for 'moving average' method.`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`is_test(bool, optional): Whether quantization with training or not. Default is True.`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`scale_dict(dict, optional): calibration ranges of tensors output.`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def __init__(`
			`self,`
			`place,`
			`scope,`
			`quant_bits=8,`
			`quant_axis=-1,`
			`channel_wise=False,`
			`moving_rate=0.9,`
			`is_test=True,`
			`scale_dict=None,`
			`):`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self._place = place`
			`self._scope = scope`
			`self.quant_bits = quant_bits`
			`self.quant_axis = quant_axis`
			`self.channel_wise = channel_wise`
			`self._is_test = is_test`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`self._moving_rate = moving_rate`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`self._scale_dict = scale_dict`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`def insert_quant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`self,`
			`graph,`
			`var_node,`
			`var_name=None,`
			`scale_var_node=None,`
			`op_role=core.op_proto_and_checker_maker.OpRole.Forward,`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`):`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`assert var_node.is_var(), f'{var_node.name()} is not a var'`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`var_name = var_node.name() if not var_name else var_name`
			`quant_var_node = graph.create_var_node(`
			`name=self._quantized_var_name(var_name),`
			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`if not scale_var_node:`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`scale_name = self._quantized_scale_name(var_name)`
			`if self.channel_wise:`
			`scale_var_shape = var_node.shape()[self.quant_axis]`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`scale_var_type = core.VarDesc.VarType.DENSE_TENSOR`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`init_scale_value = (`
			`np.ones(scale_var_shape, dtype=data_type)`
			`* _SCALE_DEFAULT_VALUE`
			`)`
			`else:`
			`scale_var_shape = 1`
			`scale_var_type = var_node.type()`
			`init_scale_value = np.array(`
			`[_SCALE_DEFAULT_VALUE], dtype=data_type`
			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`if (`
			`self._scale_dict is not None`
			`and var_node.name() in self._scale_dict.keys()`
			`):`
			`init_scale_value = np.array(`
			`[self._scale_dict[var_node.name()]], dtype=data_type`
			`)`
			`scale_var_node = graph.create_persistable_node(`
			`name=scale_name,`
			`var_type=scale_var_type,`
			`shape=[scale_var_shape],`
			`var_dtype=var_node.dtype(),`
			`)`
			`_init_var_node(`
			`scale_var_node, init_scale_value, self._scope, self._place`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`zero_point_node = None`
			`if zero_point_node is None:`
			`zero_point_node = graph.create_persistable_node(`
			`name=self._zero_point_name(quant_var_node.name()),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`shape=scale_var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=core.VarDesc.VarType.INT32,`
			`)`
			`_init_var_node(`
			`zero_point_node,`
			`np.zeros(scale_var_node.shape(), dtype="int32"),`
			`self._scope,`
			`self._place,`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`inputs = {"X": var_node, "Scale": scale_var_node}`
			`if zero_point_node is not None:`
			`inputs["ZeroPoint"] = zero_point_node`

fix quantization clip and round Attribute (#43764) 2022-06-24 17:27:59 +08:00			`attrs = {"quant_axis": self.quant_axis, "bit_length": self.quant_bits}`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`attrs["op_role"] = op_role`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`outputs = {"Y": quant_var_node}`
			`if not self._is_test:`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`scale_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`scale_var_node.var()`
			`)`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`state_in_node = graph.create_persistable_node(`
			`name=unique_name.generate('state'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`var_dtype=var_node.dtype(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`shape=[1],`
			`)`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`elif var_node.dtype() == paddle.float32:`
[AutoParallel] quantization pass support export (#48072) * [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: carryyu <569782149@qq.com> Co-authored-by: wufeisheng <wfs1997@163.com> 2022-12-27 13:32:11 +08:00			`data_type = 'float32'`
			`else:`
			`data_type = "float16"`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`_init_var_node(`
			`state_in_node,`
			`np.ones([1], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`accum_in_node = graph.create_persistable_node(`
			`name=unique_name.generate('accum'),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`var_dtype=var_node.dtype(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`shape=[1],`
			`)`
			`_init_var_node(`
			`accum_in_node,`
			`np.ones([1], dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`state_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`state_in_node.var()`
			`)`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`accum_out_node = graph.create_var_node_from_desc(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`accum_in_node.var()`
			`)`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`outputs["OutScale"] = scale_out_node`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`inputs['InState'] = state_in_node`
			`inputs['InAccum'] = accum_in_node`
			`outputs['OutState'] = state_out_node`
			`outputs['OutAccum'] = accum_out_node`
			`attrs["is_test"] = self._is_test`
			`attrs['moving_rate'] = self._moving_rate`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`quant_op_node = graph.create_op_node(`
			`op_type="quantize_linear",`
			`attrs=attrs,`
			`inputs=inputs,`
			`outputs=outputs,`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`graph.link_to(var_node, quant_op_node)`
			`graph.link_to(scale_var_node, quant_op_node)`
			`if zero_point_node is not None:`
			`graph.link_to(zero_point_node, quant_op_node)`
			`graph.link_to(quant_op_node, quant_var_node)`
			`if not self._is_test:`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`graph.link_to(state_in_node, quant_op_node)`
			`graph.link_to(accum_in_node, quant_op_node)`
			`graph.link_to(quant_op_node, state_out_node)`
			`graph.link_to(quant_op_node, accum_out_node)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`graph.link_to(quant_op_node, scale_out_node)`
			`return quant_var_node, scale_var_node`

support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`def insert_dequant_op(self, graph, var_node, scale_var_node, op_role):`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`assert var_node.is_var(), f'{var_node.name()} is not a var'`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`dequant_var_node = graph.create_var_node(`
			`name=self._dequantized_var_name(var_node.name()),`
			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`zero_point_node = None`
			`if zero_point_node is None:`
			`zero_point_node = graph.create_persistable_node(`
			`name=self._zero_point_name(dequant_var_node.name()),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`shape=scale_var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=core.VarDesc.VarType.INT32,`
			`)`
			`_init_var_node(`
			`zero_point_node,`
			`np.zeros(scale_var_node.shape(), dtype="int32"),`
			`self._scope,`
			`self._place,`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`inputs = {"X": var_node, "Scale": scale_var_node}`
			`if zero_point_node is not None:`
			`inputs["ZeroPoint"] = zero_point_node`

			`attrs = {"quant_axis": self.quant_axis, "bit_length": self.quant_bits}`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`attrs["op_role"] = op_role`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`quant_op_node = graph.create_op_node(`
			`op_type="dequantize_linear",`
			`attrs=attrs,`
			`inputs=inputs,`
			`outputs={"Y": dequant_var_node},`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`graph.link_to(var_node, quant_op_node)`
			`graph.link_to(scale_var_node, quant_op_node)`
			`if zero_point_node is not None:`
			`graph.link_to(zero_point_node, quant_op_node)`
			`graph.link_to(quant_op_node, dequant_var_node)`
			`return dequant_var_node`

			`def _quantized_var_name(self, var_name):`
			`"""`
			Return quantized variable name for the input `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}.quantized"`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`def _dequantized_var_name(self, var_name):`
			`"""`
			Return dequantized variable name for the input `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}.dequantized"`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`def _quantized_scale_name(self, var_name):`
			`"""`
			Return the scale name of quantized variable for the input `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}@scale"`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`def _zero_point_name(self, var_name):`
			`"""`
			Return the scale name for the var named `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}@zero_point"`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00

fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`class QuantizationTransformPassV2(QuantizationTransformPass):`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`Quantize the ops that have weights. Add quant and dequant ops for`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`the quantized ops's inputs. It is used in the new format of quantization.`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def __init__(`
			`self,`
			`scope=None,`
			`place=None,`
			`weight_bits=8,`
			`activation_bits=8,`
			`activation_quantize_type='abs_max',`
			`weight_quantize_type='abs_max',`
			`window_size=10000,`
			`moving_rate=0.9,`
			`skip_pattern=['skip_quant'],`
			`quantizable_op_type=['conv2d', 'depthwise_conv2d', 'mul'],`
			`weight_quantize_func=None,`
			`act_quantize_func=None,`
			`weight_preprocess_func=None,`
			`act_preprocess_func=None,`
			`optimizer_func=None,`
			`executor=None,`
			`is_test=None,`
			`):`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`r"""`
			`Args:`
			`scope(paddle.Scope): When activation use 'range_abs_max' as the quantize`
			`type, this pass will create some new parameters. The scope is used to`
			`initialize these new parameters.`
			`place(paddle.CPUPlace\|paddle.CUDAPlace\|str): place is used to initialize new`
			parameters described above. If it's string, It can be ``cpu``, and ``gpu:x``,
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			where ``x`` is the index of the GPUs.
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`weight_bits(int): quantization bit number for weights,`
			`the bias is not quantized.`
			`activation_bits(int): quantization bit number for activation.`
			`activation_quantize_type(str): quantization type for activation,`
			`now support 'abs_max', 'range_abs_max' and 'moving_average_abs_max'.`
			`If use 'abs_max' mode, the quantization scale will be calculated`
			`dynamically each step in both training and testing period. If use`
			`'range_abs_max', a static quantization scale will be calculated`
			`during training and used in inference.`
			`weight_quantize_type(str): quantization type for weights,`
			`support 'abs_max' and 'channel_wise_abs_max'. The 'range_abs_max'`
			`usually is not used for weight, since weights are fixed once the`
			`model is well trained.`
			`window_size(int): the window size for 'range_abs_max' quantization.`
			`moving_rate(float): the param for 'moving_average_abs_max' quantization.`
			`skip_pattern(str or str list): The user-defined quantization skip pattern, which`
			`will be presented in the name scope of an op. When the skip pattern is`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`detected in an op's name scope, the corresponding op will not be quantized.`
			`quantizable_op_type(list[str]): List the type of ops that will be quantized.`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`Default is ["conv2d", "depthwise_conv2d", "mul"]. The quantizable_op_type in`
			`QuantizationFreezePass and ConvertToInt8Pass must be the same as this.`
			`weight_quantize_func(function): Function that defines how to quantize weight.`
			`Using this can quickly test if user's quantization method works or not.`
			`In this function, user should both define quantization function and`
			`dequantization function, that is, the function's input is non-quantized`
			`weight and function returns dequantized weight. If None, will use`
			`quantization op defined by 'weight_quantize_type'. Default is None.`
			`act_quantize_func(function): Function that defines how to quantize activation.`
			`Using this can quickly test if user's quantization method works or not.`
			`In this function, user should both define quantization and dequantization`
			`process, that is, the function's input is non-quantized activation and`
			`function returns dequantized activation. If None, will use quantization`
			`op defined by 'activation_quantize_type'. Default is None.`
			`weight_preprocess_func(function): Function that defines how to preprocess`
			`weight before quantization. Using this can quickly test if user's preprocess`
			`method works or not. The function's input is non-quantized weight and`
			`function returns processed weight to be quantized. If None, the weight will`
			`be quantized directly. Default is None.`
			`act_preprocess_func(function): Function that defines how to preprocess`
			`activation before quantization. Using this can quickly test if user's`
			`preprocess method works or not. The function's input is non-quantized`
			`activation and function returns processed activation to be quantized.`
			`If None, the activation will be quantized directly. Default is None.`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00			`optimizer_func(function): Function return a optimizer. When 'is_test' is`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`False and user want to use self-defined quantization function and`
			`preprocess function, this function must be set. Default is None.`
			`executor(paddle.Executor): If user want to use self-defined quantization`
			`function and preprocess function, executor must be set for initialization.`
			`Default is None.`

			`Examples:`
[CodeStyle][DocFormat][128] Use `pycon` marker in static nn/quantization docs examples (#77992) 2026-02-20 13:41:25 +08:00			`.. code-block:: pycon`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00
			`>>> # The original graph will be rewrite.`
			`>>> import paddle`
			`>>> import paddle.static as static`
			`>>> from paddle.static.quantization import QuantizationTransformPassV2`
【Fluid Clean】 Rename paddle.fluid to paddle.base (#56576) * rename fluid to base * fix setup and rename paddle.fluid * fix protobuf generated paddle.fluid * fix some errors * fix cmake depends * fix some error * try to fix cache error * fix codestyle && some doc error * remove build change * fix some error --------- Co-authored-by: XiangGao <jeff41404@gmail.com> 2023-09-07 17:26:19 +08:00			`>>> from paddle.base.framework import IrGraph`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00			`>>> from paddle.framework import core`

			`>>> graph = IrGraph(core.Graph(static.Program().desc), for_test=False)`
			`>>> place = paddle.CPUPlace()`
			`>>> scope = paddle.static.global_scope()`
			`>>> transform_pass = QuantizationTransformPassV2(scope, place)`
			`>>> transform_pass.apply(graph)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`self._scope = scope`
			`self._place = _get_paddle_place(place)`
			`self._weight_bits = weight_bits`
			`self._activation_bits = activation_bits`
			`self._skip_pattern = skip_pattern`
			`self._weight_quantize_func = weight_quantize_func`
			`self._act_quantize_func = act_quantize_func`
			`self._weight_preprocess_func = weight_preprocess_func`
			`self._act_preprocess_func = act_preprocess_func`
			`self._optimizer = optimizer_func`
			`self._exe = executor`
fix conv1d_transpose insert quant node bug (#53320) 2023-05-06 18:23:45 +08:00			`self._conv1dtranspose_flag = False`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`quant_type = [`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`'abs_max',`
			`'channel_wise_abs_max',`
			`'range_abs_max',`
			`'moving_average_abs_max',`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`]`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert activation_quantize_type != 'channel_wise_abs_max', (`
			`"The activation quantization type does not support 'channel_wise_abs_max'."`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`if activation_quantize_type not in quant_type:`
			`raise ValueError(`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f"Unknown activation_quantize_type : '{activation_quantize_type}'. It can only be "`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`"'abs_max' or 'range_abs_max' or 'moving_average_abs_max'."`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`if weight_quantize_type not in quant_type:`
			`raise ValueError(`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f"Unknown weight_quantize_type: '{weight_quantize_type}'. It can only be "`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"'abs_max' or 'channel_wise_abs_max' or 'range_abs_max' "`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`"or 'moving_average_abs_max'."`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`self._activation_quantize_type = activation_quantize_type`
			`self._weight_quantize_type = weight_quantize_type`
			`self._window_size = window_size`
			`self._moving_rate = moving_rate`

			`self._quantizable_ops = quantizable_op_type`
			`for op in self._quantizable_ops:`
Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`assert op in list(SUPPORT_WEIGHT_QUANTIZATION_OP_DICT.keys()), (`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`op + " is not supported for quantization."`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self._quantizable_grad_ops = [`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f'{op}_grad' for op in self._quantizable_ops`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`]`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`self._is_test = is_test`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self._global_step = None`

			`self.create_var_map = {}`
			`self.create_op_map = {}`

			`def _quant_preprocess(self, op_node):`
			`user_skipped = False`
			`if isinstance(self._skip_pattern, list):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`user_skipped = op_node.op().has_attr("op_namescope") and any(`
			`pattern in op_node.op().attr("op_namescope")`
			`for pattern in self._skip_pattern`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`elif isinstance(self._skip_pattern, str):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`user_skipped = (`
			`op_node.op().has_attr("op_namescope")`
			`and op_node.op().attr("op_namescope").find(self._skip_pattern)`
			`!= -1`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`if user_skipped:`
			`op_node.op()._set_attr("skip_quant", True)`
			`op_node.op()._set_attr("with_quant_attr", True)`

			`def _transform_forward(self, graph, op):`
			`op.op()._set_attr("quantization_type", "qat_with_weight")`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`op_role = op.op().attr("op_role")`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`weight_scale_node = None`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`inputs = op.inputs`
			`for var_node in inputs:`
			`if var_node.name() not in op.input_arg_names():`
			`continue`
			`if var_node.name() in self.dequantized_vars:`
			`dequant_var_node = self.dequantized_vars[var_node.name()]`
			`else:`
			`name = var_node.name()`
			`if name in self.processed_vars:`
			`continue`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`is_weight = (`
Integrate QAT into distributed optimizer (#54241) * Support AMP program for onnx QAT API * Integrate QAT into distributed optimizer * Reduce the size of test data and increase time limit * Use logger and reduce time limit of unittests * Rename and move unittest into fleet test * Test qat_init API 2023-07-13 13:41:32 +08:00			`True`
			`if var_node.name() in self.persistable_vars`
			`or var_node.name() in self.persistable_cast_output_vars`
			`else False`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`# if var node is weight and weight_preprocess_func is not None,`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`# will insert weight preprocess func`
Fix minimun minimum (#60920) 2024-01-18 11:31:56 +08:00			`# to preprocess weight before quantization`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`# if var node is activation and act_preprocess_func is not None,`
			`# will insert activation preprocess func`
Fix minimun minimum (#60920) 2024-01-18 11:31:56 +08:00			`# to preprocess activation before quantization`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`if is_weight and self._weight_preprocess_func is not None:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_node = self._insert_func(`
			`graph, self._weight_preprocess_func, var_node, op`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`elif not is_weight and self._act_preprocess_func is not None:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_node = self._insert_func(`
			`graph, self._act_preprocess_func, var_node, op`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`# if var node is weight and weight_quantize_func is not None,`
			`# will insert weight quantize func to quantize and dequantize weight`
			`# if var node is activation and act_quantize_func is not None,`
			`# will insert act quantize func to quantize and dequantize activation`
			`if is_weight and self._weight_quantize_func is not None:`
			`target_out_node = self._insert_func(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph, self._weight_quantize_func, var_node, op`
			`)`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`self.processed_vars.append(name)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`continue`
			`elif not is_weight and self._act_quantize_func is not None:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`target_out_node = self._insert_func(`
			`graph, self._act_quantize_func, var_node, op`
			`)`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`self.processed_vars.append(name)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`continue`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`quant_bits = (`
			`self._weight_bits`
			`if var_node.name() in self.persistable_vars`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`else self._activation_bits`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
			`quant_type = (`
			`self._weight_quantize_type`
			`if is_weight`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`else self._activation_quantize_type`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`quant_axis = -1`
			`channel_wise = False`
			`if quant_type == 'channel_wise_abs_max': # Weight quantization`
			`channel_wise = True`
fix the bug of quanting matmul (#52833) 2023-04-18 17:35:01 +08:00			`op_type = op.name()`
			`trans_y = (op_type == 'matmul_v2') and op.op().attr(`
			`'trans_y'`
			`)`
			`op_type = op_type + '_trans_y' if trans_y else op_type`
fix conv1d_transpose insert quant node bug (#53320) 2023-05-06 18:23:45 +08:00			`if self._conv1dtranspose_flag:`
			`quant_axis = 1`
			`self._conv1dtranspose_flag = False`
			`else:`
			`quant_axis = (`
			`1`
			`if op.name() in utils._channelwise_quant_axis1_ops`
			`else 0`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`insert_quant_pass = InsertQuantizeLinear(`
			`self._place,`
			`self._scope,`
			`quant_bits=quant_bits,`
			`quant_axis=quant_axis,`
			`channel_wise=channel_wise,`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`moving_rate=self._moving_rate,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`is_test=self._is_test,`
			`)`
			`(`
			`quant_var_node,`
			`scale_var_node,`
			`) = insert_quant_pass.insert_quant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph, var_node, var_name=name, op_role=op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`dequant_var_node = insert_quant_pass.insert_dequant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph, quant_var_node, scale_var_node, op_role`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`self.dequantized_vars[name] = dequant_var_node`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`if is_weight:`
			`weight_scale_node = scale_var_node`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`graph.update_input_link(var_node, dequant_var_node, op)`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`return weight_scale_node`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`def _transform_backward(self, graph, op):`
			`for var_node in op.inputs:`
			`if var_node.name() not in op.input_arg_names():`
			`continue`
			`if var_node.name() in self.dequantized_vars:`
			`dequant_var_node = self.dequantized_vars[var_node.name()]`
			`graph.update_input_link(var_node, dequant_var_node, op)`

			`def _has_weight(self, op):`
			`has_weight = False`
			`for var_node in op.inputs:`
			`if var_node.name() not in op.input_arg_names():`
			`continue`
Integrate QAT into distributed optimizer (#54241) * Support AMP program for onnx QAT API * Integrate QAT into distributed optimizer * Reduce the size of test data and increase time limit * Use logger and reduce time limit of unittests * Rename and move unittest into fleet test * Test qat_init API 2023-07-13 13:41:32 +08:00			`if (`
			`var_node.name() in self.persistable_vars`
			`or var_node.name() in self.persistable_cast_output_vars`
			`):`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`has_weight = True`
			`return has_weight`

support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`def _quant_conv1d(self, graph, op):`
			`# conv1d in inference is a combination of unsqueeze2 + conv2d`
			`if ("conv2d" not in op.name()) or (`
			`"unsqueeze2" not in op.input("Filter")[0]`
			`):`
			`return`
			`conv_weight_var_name = op.input("Filter")[0]`
			`# unsqueeze2 and conv2d will share weight scale`
			`weight_scale_node = None`
			`# quant unsqueeze2`
			`for _op in graph.all_op_nodes():`
			`var_names = utils._get_op_output_var_names(_op)`
			`if conv_weight_var_name in var_names and self._has_weight(_op):`
fix conv1d_transpose insert quant node bug (#53320) 2023-05-06 18:23:45 +08:00			`if op.name() == 'conv2d_transpose':`
			`if not self._is_skip_quant(graph, _op):`
			`weight_scale_node = self._transform_forward(graph, _op)`
			`else:`
			`weight_scale_node = self._transform_forward(graph, _op)`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`# insert qdq before conv2d`
			`for var_node in op.inputs:`
			`quant_bits = (`
			`self._weight_bits`
			`if var_node.name() == conv_weight_var_name`
			`else self._activation_bits`
			`)`
			`quant_type = (`
			`self._weight_quantize_type`
			`if var_node.name() == conv_weight_var_name`
			`else self._activation_quantize_type`
			`)`
			`quant_axis = -1`
			`channel_wise = False`
			`if quant_type == 'channel_wise_abs_max':`
			`channel_wise = True`
			`quant_axis = (`
			`1 if op.name() in utils._channelwise_quant_axis1_ops else 0`
			`)`
fix conv1d_transpose insert quant node bug (#53320) 2023-05-06 18:23:45 +08:00			`if 'unsqueeze2' in utils._channelwise_quant_axis1_ops:`
			`utils._channelwise_quant_axis1_ops.remove('unsqueeze2')`
			`if self._is_skip_quant(graph, op):`
			`return`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`insert_quant_pass = InsertQuantizeLinear(`
			`self._place,`
			`self._scope,`
			`quant_bits=quant_bits,`
			`quant_axis=quant_axis,`
			`channel_wise=channel_wise,`
			`moving_rate=self._moving_rate,`
			`is_test=self._is_test,`
			`)`
			`scale_var_node = (`
			`weight_scale_node`
			`if var_node.name() == conv_weight_var_name`
			`else None`
			`)`
			`(`
			`quant_var_node,`
			`scale_var_node,`
			`) = insert_quant_pass.insert_quant_op(`
			`graph,`
			`var_node,`
			`var_name=var_node.name(),`
			`scale_var_node=scale_var_node,`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`op_role=op.op().attr("op_role"),`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`)`
			`dequant_var_node = insert_quant_pass.insert_dequant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph,`
			`quant_var_node,`
			`scale_var_node,`
			`op.op().attr("op_role"),`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`)`
			`graph.update_input_link(var_node, dequant_var_node, op)`

add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`def apply(self, graph):`
			`"""`
			`Quantize the graph for training process. According to weight and`
			`activation quantization type, the graph will be added some fake`
			`quantize operators and fake dequantize operators.`

			`Args:`
			`graph(IrGraph): the applied graph.`
			`Returns:`
			`None`
			`"""`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`if self._is_test is None:`
			`self._is_test = graph.is_test()`
fix error of QuantizationTransformPassV2 when has condition block (#48190) * fix error of QuantizationTransformPassV2 when has condition block * fix error 2022-11-22 11:08:58 +08:00			`# marked the variable which has been dequantized.`
			`self.dequantized_vars = collections.OrderedDict()`
			`self.persistable_vars = []`
			`self.processed_vars = []`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`self.persistable_vars = [`
			`p.name() for p in graph.all_persistable_nodes()`
			`]`

			`ops = graph.all_op_nodes()`
Integrate QAT into distributed optimizer (#54241) * Support AMP program for onnx QAT API * Integrate QAT into distributed optimizer * Reduce the size of test data and increase time limit * Use logger and reduce time limit of unittests * Rename and move unittest into fleet test * Test qat_init API 2023-07-13 13:41:32 +08:00
			`# Mark the output of cast op where the input is weight for AMP program`
			`self.persistable_cast_output_vars = []`
			`for op in graph.all_op_nodes():`
			`if (`
			`op.name() == "cast"`
			`and op.inputs[0].name() in self.persistable_vars`
			`):`
			`self.persistable_cast_output_vars.append(op.outputs[0].name())`

Fix minimun minimum (#60920) 2024-01-18 11:31:56 +08:00			`# Do the preprocess of quantization, such as skipping some ops`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`# for not being quantized.`
			`for op in ops:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`op.name() in self._quantizable_ops`
			`or op.name() in self._quantizable_grad_ops`
			`):`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self._quant_preprocess(op)`
			`# Insert mapping table to solve the problem in saving inference model.`
[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and unnecessary <list/tuple> passed to <list/tupule>() (#51928) * autofix * add select config * autofix C410 * add C410 select 2023-03-23 10:16:17 +08:00			`graph.out_node_mapping_table = {}`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`# The process of _transform_forward and _transform_backward is needed in two for loops.`
			`# The loop for transforming the forward graph:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`with tqdm(`
			`total=len(ops),`
			`bar_format='Adding quant op with weight:\|{bar}\| {n_fmt}/{total_fmt}',`
			`ncols=80,`
			`) as t:`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`for op in ops:`
			`if op.name() in self._quantizable_ops:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if not self._is_skip_quant(graph, op) and self._has_weight(`
			`op`
			`):`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`self._transform_forward(graph, op)`
support conv1d quant & skip calibrate zero-size tensor (#48912) 2022-12-13 10:31:36 +08:00			`else: # op is not persistable`
			`# support conv1d quantization`
			`self._quant_conv1d(graph, op)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`t.update()`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`# The loop for renaming the inputs of backward op.`
			`for op in ops:`
			`if op.name() in self._quantizable_grad_ops and self._has_weight(op):`
			`self._transform_backward(graph, op)`
			`return graph`


[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class AddQuantDequantPassV2:`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`Quantize the ops that do not have weights, and add quant_linear and dequant_linear`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`op for the quantized ops's inputs. It is used in the new format of quantization.`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`

			`# To be compatible with PaddleSlim, not remove _activation_type for now`
			`_activation_type = ["relu", "relu6", "leaky_relu", "tanh", "swish"]`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def __init__(`
			`self,`
			`scope=None,`
			`place=None,`
			`moving_rate=0.9,`
			`quant_bits=8,`
			`skip_pattern=["skip_quant"],`
			`quantizable_op_type=["elementwise_add", "pool2d"],`
			`is_test=None,`
			`scale_dict=None,`
			`):`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`Args:`
			`scope(paddle.Scope): The scope is used to initialize these new parameters.`
			`place(paddle.CPUPlace\|paddle.CUDAPlace\|str): place is used to initialize new`
			parameters described above. If ``place`` is string, it can be It can be ``cpu``
			or ``gpu:x``, where ``x`` is the index of the GPUs.
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`moving_rate(float, optional): the param for 'quant_dequant_moving_average_abs_max'`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`quantization. Default is 0.9.`
			`quant_bits(int, optional): quantization bit number for activation. Default is 8.`
			`skip_pattern(str, optional): The user-defined quantization skip pattern, which`
			`will be presented in the name scope of an op. When the skip pattern is`
			`detected in an op's name scope, the corresponding op will not be quantized.`
			`Default is 'skip_quant'.`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00			`quantizable_op_type(list[str], optional): List the type of ops that will be`
			`quantized. Default is ["elementwise_add", "pool2d"].`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`scale_dict(dict, optional): calibration ranges of tensors output.`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`Examples:`
[CodeStyle][DocFormat][128] Use `pycon` marker in static nn/quantization docs examples (#77992) 2026-02-20 13:41:25 +08:00			`.. code-block:: pycon`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00
			`>>> # The original graph will be rewrite.`
			`>>> import paddle`
			`>>> import paddle.static as static`
			`>>> from paddle.static.quantization import AddQuantDequantPassV2`
【Fluid Clean】 Rename paddle.fluid to paddle.base (#56576) * rename fluid to base * fix setup and rename paddle.fluid * fix protobuf generated paddle.fluid * fix some errors * fix cmake depends * fix some error * try to fix cache error * fix codestyle && some doc error * remove build change * fix some error --------- Co-authored-by: XiangGao <jeff41404@gmail.com> 2023-09-07 17:26:19 +08:00			`>>> from paddle.base.framework import IrGraph`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00			`>>> from paddle.framework import core`

			`>>> graph = IrGraph(core.Graph(static.Program().desc), for_test=False)`
			`>>> place = paddle.CPUPlace()`
			`>>> scope = paddle.static.global_scope()`
			`>>> add_quant_dequant_pass = AddQuantDequantPassV2(scope, place)`
			`>>> add_quant_dequant_pass.apply(graph)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`self._scope = scope`
			`self._place = _get_paddle_place(place)`
			`self._moving_rate = moving_rate`
			`self._quant_bits = quant_bits`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`self._is_test = is_test`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self._skip_pattern = skip_pattern`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`self._scale_dict = scale_dict`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`self._quantizable_op_type = quantizable_op_type`
			`for op_type in self._quantizable_op_type:`
			`assert op_type in list(SUPPORT_ACT_QUANTIZATION_OP_DICT.keys()), (`
			`op_type + " is not supported for quantization."`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self._quantizable_grad_op_type = [`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`f'{op}_grad' for op in self._quantizable_op_type`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`]`

[CodeStyle][E711] use `is`/`is not` for comparison with `None` (#47452) * [CodeStyle][E711] use `is`/`is not` for comparison with `None` * `self.assertTrue($A is None)` -> `self.assertIsNone($A)` * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)` * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)` * `self.assertEqual($A, None)` -> `self.assertIsNone($A)` * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)` 2022-11-01 22:14:52 +08:00			`assert self._scope is not None, "scope must not be None."`
			`assert self._place is not None, "place must not be None."`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self.persistable_vars = []`

			`def apply(self, graph):`
			`"""`
			`Add quant_dequant before some ops, such as the 'elementwise_add' and`
			`'pool2d' op.`

			`Args:`
			`graph(IrGraph): the target graph.`
			`Returns:`
			`None`
			`"""`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
add new function ptq first then initialize qat scale with ptq scale (#44747) 2022-08-25 14:08:04 +08:00			`if self._is_test is None:`
			`self._is_test = graph.is_test()`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`dequantized_vars_map = collections.OrderedDict()`

			`self.persistable_vars = [`
			`p.name() for p in graph.all_persistable_nodes()`
			`]`

			`# Forward stage, insert quant_dequant op`
			`all_op_nodes = graph.all_op_nodes()`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`with tqdm(`
			`total=len(all_op_nodes),`
			`bar_format='Adding quant activation op:\|{bar}\| {n_fmt}/{total_fmt}',`
			`ncols=80,`
			`) as t:`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`for op_node in all_op_nodes:`
			`if op_node.name() in self._quantizable_op_type:`
			`is_skip = False`
			`if isinstance(self._skip_pattern, list):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`is_skip = op_node.op().has_attr("op_namescope") and any(`
			`pattern in op_node.op().attr("op_namescope")`
			`for pattern in self._skip_pattern`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`elif isinstance(self._skip_pattern, str):`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`is_skip = (`
			`op_node.op().has_attr("op_namescope")`
			`and op_node.op()`
			`.attr("op_namescope")`
			`.find(self._skip_pattern)`
			`!= -1`
			`)`
			`is_quantized = (`
			`op_node.op().has_attr("quantization_type")`
			`and op_node.op().attr("quantization_type")`
			`== "qat_with_weight"`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`if is_skip or is_quantized:`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`continue`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00
			`arg_names = utils._get_op_input_var_names(op_node)`
Support matmul in QAT and loading quantized models in PTQ (#47892) 2022-12-05 15:14:18 +08:00			`# If already quanted, skip it.`
			`skip_quant = False`
			`for arg_name in arg_names:`
			`if "quantized.dequantized" in arg_name:`
			`skip_quant = True`
			`break`
			`if skip_quant:`
			`continue`

Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`for arg_name in arg_names:`
			`in_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.inputs, arg_name`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`if in_node.persistable():`
			`continue`
Skip the int input operator when inserting a quant node & fix some bug (#49926) 2023-02-01 16:33:04 +08:00
			`if in_node.dtype() not in [`
			`paddle.float64,`
			`paddle.float32,`
			`paddle.float16,`
			`]:`
			`_logger.warning(`
[CodeStyle][ruff] fix v0.3.3 UP032 (#63111) 2024-04-01 10:20:33 +08:00			`f"Since the {op_node.name()} contains an input of type INT, the quantization of this layer is skipped."`
Skip the int input operator when inserting a quant node & fix some bug (#49926) 2023-02-01 16:33:04 +08:00			`)`
			`break`

Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`if arg_name in dequantized_vars_map:`
			`dequant_var_node = dequantized_vars_map[arg_name]`
			`else:`
			`insert_quant_pass = InsertQuantizeLinear(`
			`self._place,`
			`self._scope,`
			`quant_bits=self._quant_bits,`
			`quant_axis=-1,`
			`channel_wise=False,`
fix QuantizeLinear kernel and pass in QAT (#44784) 2022-08-03 20:07:41 +08:00			`moving_rate=self._moving_rate,`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`is_test=self._is_test,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`scale_dict=self._scale_dict,`
			`)`
			`(`
			`quant_var_node,`
			`scale_var_node,`
			`) = insert_quant_pass.insert_quant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph,`
			`in_node,`
			`op_role=op_node.op().attr("op_role"),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
			`dequant_var_node = (`
			`insert_quant_pass.insert_dequant_op(`
support hybrid parallel in qat (#52219) 2023-04-06 20:56:35 +08:00			`graph,`
			`quant_var_node,`
			`scale_var_node,`
			`op_node.op().attr("op_role"),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`dequantized_vars_map[arg_name] = dequant_var_node`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph.update_input_link(`
			`in_node, dequant_var_node, op_node`
			`)`
Add progress bar and speed up Quantization Pass (#43398) 2022-06-13 16:03:44 +08:00			`t.update()`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`# Backward stage, update input link`
			`for op_node in all_op_nodes:`
			`if op_node.name() in self._quantizable_grad_op_type:`
			`for input_name in op_node.input_arg_names():`
			`if input_name in dequantized_vars_map:`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`in_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.inputs, input_name`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`dequant_var_node = dequantized_vars_map[input_name]`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph.update_input_link(`
			`in_node, dequant_var_node, op_node`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`return graph`


[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class ReplaceFakeQuantDequantPass:`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`replace quant-dequant ops with quantize_linear and dequantize_linear ops.`
			`"""`

update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`def __init__(self, scope, place, quant_bits=8):`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`r"""`
			`Args:`
			`scope(paddle.Scope): The scope is used to initialize these new parameters.`
			`place(paddle.CPUPlace\|paddle.CUDAPlace\|str): place is used to initialize new`
			parameters described above. If ``place`` is string, it can be It can be ``cpu``
			or ``gpu:x``, where ``x`` is the index of the GPUs.
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`quant_bits(int, optional): quantization bit number for activation. Default is 8.`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`Examples:`
[CodeStyle][DocFormat][128] Use `pycon` marker in static nn/quantization docs examples (#77992) 2026-02-20 13:41:25 +08:00			`.. code-block:: pycon`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00
			`>>> # The original graph will be rewrite.`
			`>>> import paddle`
			`>>> import paddle.static as static`
			`>>> from paddle.static.quantization import ReplaceFakeQuantDequantPass`
【Fluid Clean】 Rename paddle.fluid to paddle.base (#56576) * rename fluid to base * fix setup and rename paddle.fluid * fix protobuf generated paddle.fluid * fix some errors * fix cmake depends * fix some error * try to fix cache error * fix codestyle && some doc error * remove build change * fix some error --------- Co-authored-by: XiangGao <jeff41404@gmail.com> 2023-09-07 17:26:19 +08:00			`>>> from paddle.base.framework import IrGraph`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00			`>>> from paddle.framework import core`

			`>>> graph = IrGraph(core.Graph(static.Program().desc), for_test=False)`
			`>>> place = paddle.CPUPlace()`
			`>>> scope = paddle.static.global_scope()`
			`>>> replace_pass = ReplaceFakeQuantDequantPass(scope, place)`
			`>>> replace_pass.apply(graph)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`self._place = _get_paddle_place(place)`
			`self._scope = scope`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`self._quant_bits = quant_bits`
[CodeStyle][E711] use `is`/`is not` for comparison with `None` (#47452) * [CodeStyle][E711] use `is`/`is not` for comparison with `None` * `self.assertTrue($A is None)` -> `self.assertIsNone($A)` * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)` * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)` * `self.assertEqual($A, None)` -> `self.assertIsNone($A)` * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)` 2022-11-01 22:14:52 +08:00			`assert self._scope is not None, "scope must not be None."`
			`assert self._place is not None, "place must not be None."`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`def apply(self, graph):`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`fake_quant_dequant_ops = []`
Delete duplicate quant nodes in QAT (#48751) 2022-12-08 10:04:27 +08:00			`remove_fake_quant_ops = []`
			`observer_out_node_names = []`
			`for op in graph.all_op_nodes():`
			`# collect observer node`
			`if op.name() == "moving_average_abs_max_scale":`
			`observer_out_node_names.append(op.output("Out")[0])`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`for op in graph.all_op_nodes():`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if (`
			`op.name() in _fake_quant_dequant_op_list`
			`or op.name() == "moving_average_abs_max_scale"`
			`):`
Delete duplicate quant nodes in QAT (#48751) 2022-12-08 10:04:27 +08:00			`var_name = op.input("X")[0]`
			`if var_name in observer_out_node_names:`
			`remove_fake_quant_ops.append(op)`
			`else:`
			`fake_quant_dequant_ops.append(op)`

			`for _op in remove_fake_quant_ops:`
			`x_node = graph._find_node_by_name(_op.inputs, _op.input("X")[0])`
			`out_node = graph._find_node_by_name(`
			`_op.outputs, _op.output("Out")[0]`
			`)`
			`for next_op_node in out_node.outputs:`
			`graph.update_input_link(out_node, x_node, next_op_node)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`for _op in fake_quant_dequant_ops:`
			`self._replace_op(graph, _op)`
			`graph.safe_remove_nodes(_op)`

			`graph.resolve_hazard()`
			`return graph`

			`def _replace_op(self, graph, op):`
			`x_node = graph._find_node_by_name(op.inputs, op.input("X")[0])`
			`out_node = graph._find_node_by_name(op.outputs, op.output("Out")[0])`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`scale_node = graph._find_node_by_name(`
			`op.outputs, op.output("OutScale")[0]`
			`)`

			`quant_axis = (`
			`op.op().attr("quant_axis") if op.op().has_attr("quant_axis") else -1`
			`)`
			`bit_length = (`
			`op.op().attr("bit_length")`
			`if op.op().has_attr("bit_length")`
			`else self._quant_bits`
			`)`
Fp8 ptq kernel function (#65441) add fp8 ptq 2024-07-03 14:24:04 +08:00			`qmax = (1 << (bit_length - 1)) - 1`
			`qmin = -1 * qmax - 1`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`zero_point_node = None`
			`quanted_node = x_node`
			`if zero_point_node is None:`
			`zero_point_node = graph.create_persistable_node(`
			`name=self._zero_point_name(quanted_node.name()),`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`shape=scale_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=core.VarDesc.VarType.INT32,`
			`)`
			`_init_var_node(`
			`zero_point_node,`
			`np.zeros(scale_node.shape(), dtype="int32"),`
			`self._scope,`
			`self._place,`
			`)`

			`quant_var_node = graph.create_var_node(`
			`name=self._quantized_var_name(x_node.name()),`
			`var_type=x_node.type(),`
			`shape=x_node.shape(),`
			`var_dtype=x_node.dtype(),`
			`)`
			`quant_op_node = graph.create_op_node(`
			`op_type="quantize_linear",`
Fp8 ptq kernel function (#65441) add fp8 ptq 2024-07-03 14:24:04 +08:00			`attrs={`
			`"quant_axis": quant_axis,`
			`"bit_length": bit_length,`
			`"qmin": qmin,`
			`"qmax": qmax,`
			`},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`inputs={`
			`"X": x_node,`
			`"Scale": scale_node,`
			`"ZeroPoint": zero_point_node,`
			`},`
			`outputs={"Y": quant_var_node},`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`graph.link_to(x_node, quant_op_node)`
			`graph.link_to(scale_node, quant_op_node)`
			`if zero_point_node is not None:`
			`graph.link_to(zero_point_node, quant_op_node)`
			`graph.link_to(quant_op_node, quant_var_node)`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`dequant_op_node = graph.create_op_node(`
			`op_type="dequantize_linear",`
Fp8 ptq kernel function (#65441) add fp8 ptq 2024-07-03 14:24:04 +08:00			`attrs={`
			`"quant_axis": quant_axis,`
			`"bit_length": bit_length,`
			`"qmin": qmin,`
			`"qmax": qmax,`
			`},`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`inputs={`
			`"X": quant_var_node,`
			`"Scale": scale_node,`
			`"ZeroPoint": zero_point_node,`
			`},`
			`outputs={"Y": out_node},`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`graph.link_to(quant_var_node, dequant_op_node)`
			`graph.link_to(scale_node, dequant_op_node)`
			`if zero_point_node is not None:`
			`graph.link_to(zero_point_node, dequant_op_node)`
			`graph.link_to(dequant_op_node, out_node)`

			`def _quantized_var_name(self, var_name):`
			`"""`
			Return quantized variable name for the input `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}.quantized"`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`def _zero_point_name(self, var_name):`
			`"""`
			Return the scale name for the var named `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}@zero_point"`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00

[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class QuantWeightPass:`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`
			`quant weights and remove weights input quantize_linear node. for example:`
			`weight -> quant -> dequant -> conv2d` will be frozen into `weight -> dequant -> conv2d`,
			`and weight will be scaled offline.`

			`Args:`
			`scope(paddle.Scope): scope is used to get the weight tensor values.`
			`place(paddle.CPUPlace\|paddle.CUDAPlace\|str): place is used to restore the weight tensors.`
			If it's string, It can be ``cpu``, and ``gpu:x``, where ``x`` is the index of the GPUs.
			`bias_correction(bool): whether use bias correction for post-training quantization.`
			`https://arxiv.org/abs/1810.05723.`
			`quant_bits(int, optional): quantization bit number for weight. Default is 8.`
			`save_int_weight(bool, optional): Whether the type saving the weight is int. Default is True.`
[CodeStyle][W291] trim trailing whitespace in python file (#45937) * trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error 2022-09-14 21:56:19 +08:00
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`Examples:`
[CodeStyle][DocFormat][128] Use `pycon` marker in static nn/quantization docs examples (#77992) 2026-02-20 13:41:25 +08:00			`.. code-block:: pycon`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00
			`>>> # The original graph will be rewrite.`
			`>>> import paddle`
			`>>> import paddle.static as static`
			`>>> from paddle.static.quantization import QuantWeightPass`
【Fluid Clean】 Rename paddle.fluid to paddle.base (#56576) * rename fluid to base * fix setup and rename paddle.fluid * fix protobuf generated paddle.fluid * fix some errors * fix cmake depends * fix some error * try to fix cache error * fix codestyle && some doc error * remove build change * fix some error --------- Co-authored-by: XiangGao <jeff41404@gmail.com> 2023-09-07 17:26:19 +08:00			`>>> from paddle.base.framework import IrGraph`
[xdoctest][task 143] Reformat example code with google style in quantization/quantization_pass.py (#56240) * [xdoctest][task 143] test=docs_preview * test=document_fix * fix indent * fix indent in python/paddle/static/quantization/quantization_pass.py --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-08-21 11:59:24 +08:00			`>>> from paddle.framework import core`

			`>>> graph = IrGraph(core.Graph(paddle.static.Program().desc), for_test=False)`
			`>>> place = paddle.CPUPlace()`
			`>>> scope = paddle.static.global_scope()`
			`>>> quant_weight_pass = QuantWeightPass(scope, place)`
			`>>> quant_weight_pass.apply(graph)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`"""`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`def __init__(`
			`self,`
			`scope,`
			`place,`
			`bias_correction=False,`
			`quant_bits=8,`
			`save_int_weight=True,`
			`):`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self._place = _get_paddle_place(place)`
			`self._scope = scope`
			`self._bias_correction = bias_correction`
			`self._quant_bits = quant_bits`
			`self._save_int_weight = save_int_weight`
[CodeStyle][E711] use `is`/`is not` for comparison with `None` (#47452) * [CodeStyle][E711] use `is`/`is not` for comparison with `None` * `self.assertTrue($A is None)` -> `self.assertIsNone($A)` * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)` * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)` * `self.assertEqual($A, None)` -> `self.assertIsNone($A)` * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)` 2022-11-01 22:14:52 +08:00			`assert self._scope is not None, "scope must not be None."`
			`assert self._place is not None, "place must not be None."`
fix quantization int8 weight save bug (#51500) 2023-03-15 10:48:23 +08:00			`self._quantized_ops = set()`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`def apply(self, graph):`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`fake_quant_ops_for_weight = []`

			`fake_quant_ops = [`
			`op for op in graph.all_op_nodes() if op.name() == "quantize_linear"`
			`]`
			`for _op in fake_quant_ops:`
			`x_node = graph._find_node_by_name(_op.inputs, _op.input("X")[0])`
			`if x_node.persistable():`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`scale_node = graph._find_node_by_name(`
			`_op.inputs, _op.input("Scale")[0]`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`zero_point_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`_op.inputs, _op.input("ZeroPoint")[0]`
			`)`
			`out_node = graph._find_node_by_name(`
			`_op.outputs, _op.output("Y")[0]`
			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`scale_v = self._load_var(scale_node.name())`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`assert scale_v.ndim in [`
			`1,`
			`2,`
			`], "the dim of scale_v should be 1 or 2"`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`if scale_v.ndim == 2:`
			`scale_v = scale_v[0]`
			`if scale_v.size == 1 and _op.name() == 'abs_max':`
			`scale_v = scale_v[0]`
			`else:`
			`scale_v = scale_v.tolist()`
			`param_v = self._load_var(x_node.name())`
			`quant_axis = _op.op().attr("quant_axis")`
			`bits_length = _op.op().attr("bit_length")`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`if x_node.name() not in self._quantized_ops:`
fix quantization int8 weight save bug (#51500) 2023-03-15 10:48:23 +08:00			`self._quantized_ops.add(x_node.name())`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`quantized_param_v = utils.quant_tensor(`
			`param_v.copy(),`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`scale_v,`
			`quant_axis,`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`bits_length,`
			`onnx_format=True,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`if self._bias_correction is True:`
Avoid Quant Weight Repeatedly (#47587) 2022-11-03 10:23:05 +08:00			`quantized_param_v = utils.bias_correction_w(`
			`param_v,`
			`quantized_param_v,`
			`scale_v,`
			`quant_axis,`
			`weight_bits=bits_length,`
			`)`
			`if self._save_int_weight:`
			`# cast weight type to int`
			`if self._quant_bits == 8:`
			`save_weight_dtype = np.int8`
			`quantized_param_v = quantized_param_v.astype(`
			`save_weight_dtype`
			`)`
fix quantization int8 weight save bug (#51500) 2023-03-15 10:48:23 +08:00			`self._restore_var(x_node.name(), quantized_param_v)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00
			`for next_op_node in out_node.outputs:`
fix quantization int8 weight save bug (#51500) 2023-03-15 10:48:23 +08:00			`graph.update_input_link(out_node, x_node, next_op_node)`
Fix the problem that the quantization model cannot find the weight (#49664) 2023-01-10 16:03:55 +08:00			`graph.safe_remove_nodes(_op)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`self._remove_unused_var_nodes(graph)`

			`def _remove_unused_var_nodes(self, graph):`
			`all_used_vars = set()`
			`ops = graph.all_op_nodes()`
			`for op_node in ops:`
			`for input_node in op_node.inputs:`
			`all_used_vars.add(input_node)`
			`for output_node in op_node.outputs:`
			`all_used_vars.add(output_node)`

			`all_used_vars = {n.node for n in all_used_vars}`
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-30 10:17:11 +08:00			`all_unused_vars = set(`
			`filter(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`lambda node: node.node not in all_used_vars,`
			`graph.all_var_nodes(),`
			`)`
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-30 10:17:11 +08:00			`)`
add new format of quantization (#41041) 2022-04-05 16:57:00 +08:00			`graph.safe_remove_nodes(all_unused_vars)`

			`def _load_var(self, name):`
			`return np.array(self._scope.find_var(name).get_tensor())`

fix quantization int8 weight save bug (#51500) 2023-03-15 10:48:23 +08:00			`def _restore_var(self, name, array):`
			`tensor = self._scope.find_var(name).get_tensor()`
			`tensor.set(array, self._place)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00

[CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) * [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment 2022-11-08 11:29:41 +08:00			`class AddQuantDequantForInferencePass:`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`"""`
			`When export quant model, it will traverse to find the output of each op, and then insert the quant/dequant op after it.`
			`"""`

Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`def __init__(`
			`self,`
			`scope,`
			`place,`
			`quant_bits=8,`
			`quantizable_op_type=[],`
			`calibration_range_dict=None,`
			`only_observer=True,`
			`):`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`"""`
			`Args:`
[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) 2022-12-30 11:03:39 +08:00			`scope(static.Scope): The scope is used to initialize these new parameters.`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`place(paddle.CPUPlace\|paddle.CUDAPlace\|str): place is used to restore the weight tensors.`
			If it's string, it can be ``cpu``, and ``gpu:x``, where ``x`` is the index of the GPUs.
			`quant_bits(int, optional): quantization bit number for weight. Default is 8.`
			`"""`
			`self._scope = scope`
			`self._place = place`
			`self._quant_bits = quant_bits`
Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`self._only_observer = only_observer`
			`self._teller_set = (`
			`quantizable_op_type`
			`if quantizable_op_type`
			`else list(SUPPORT_QUANTIZATION_OP_DICT.keys())`
			`)`
			`self._calibration_range_dict = calibration_range_dict`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00
			`def apply(self, graph):`
			`"""`
			`Args:`
			`graph(IrGraph): the target graph.`
			`"""`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`dequant_node_map = {}`
			`dequantized_vars_map = collections.OrderedDict()`
			`for op_node in graph.all_op_nodes():`
			`if op_node.name() in self._teller_set:`
			`var_names = utils._get_op_output_var_names(op_node)`
			`for var_name in var_names:`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`out_node = graph._find_node_by_name(`
			`op_node.outputs, var_name`
			`)`
			`if out_node.dtype() not in [`
Skip the int input operator when inserting a quant node & fix some bug (#49926) 2023-02-01 16:33:04 +08:00			`paddle.float64,`
			`paddle.float32,`
			`paddle.float16,`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`]:`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`continue`
			`if var_name in dequantized_vars_map:`
			`dequant_var_node = dequantized_vars_map[var_name]`
			`else:`
			`dequant_var_node = self._insert_quant_dequant_op(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`graph, out_node`
			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`dequantized_vars_map[var_name] = dequant_var_node`
			`dequant_node_map[var_name] = dequant_var_node`

Fix minimun minimum (#60920) 2024-01-18 11:31:56 +08:00			`# remove unused node and link act quant/dequant linear to op node`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`for op_node in graph.all_op_nodes():`
			`if op_node.name() == 'moving_average_abs_max_scale':`
			`graph.safe_remove_nodes(op_node)`
			`else:`
			`var_names = utils._get_op_input_var_names(op_node)`
			`for var_name in var_names:`
[CodeStyle][Ruff][BUAA][K-[1-10]] Fix Ruff RUF019 diagnostic for 10 files in paddle/ and test/(#67184) 2024-08-09 11:40:05 +08:00			`if dequant_node_map.get(var_name):`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`in_node = graph._find_node_by_name(`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`op_node.inputs, var_name`
			`)`
			`graph.update_input_link(`
			`in_node, dequant_node_map[var_name], op_node`
			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00
			`return graph`

			`def _scale_name(self, var_name):`
			`"""`
			Return the scale name for the var named `var_name`.
			`"""`
[CodeStyle][UP031] fix some `python/paddle/` - part 8 (#65552) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-06-29 02:48:21 +08:00			`return f"{var_name}@scale"`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00
			`def _insert_quant_dequant_op(self, graph, var_node):`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`assert var_node.is_var(), f'{var_node.name()} is not a var'`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`var_name = var_node.name()`
			`quant_axis = -1`
			`quant_var_node = graph.create_var_node(`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`name=f"{var_name}.quantized",`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`var_type=var_node.type(),`
			`shape=var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=var_node.dtype(),`
			`)`
Skip the int input operator when inserting a quant node & fix some bug (#49926) 2023-02-01 16:33:04 +08:00
			`try:`
			`scale_var_node = graph._find_node_by_name(`
			`graph.all_persistable_nodes(), self._scale_name(var_name)`
			`)`
			`except:`
			`if (`
			`self._calibration_range_dict`
			`and var_name in self._calibration_range_dict`
			`):`
			`scale_value = self._calibration_range_dict[var_name]`
			`scale_var_node = graph.create_persistable_node(`
			`name=self._scale_name(var_name),`
			`var_type=var_node.type(),`
			`shape=[1],`
			`var_dtype=var_node.dtype(),`
Fix the problem that the quantization model cannot find the weight (#49664) 2023-01-10 16:03:55 +08:00			`)`
Skip the int input operator when inserting a quant node & fix some bug (#49926) 2023-02-01 16:33:04 +08:00			`data_type = (`
			`'float64'`
[Cleanup][A-13] clean some VarType for test (#61566) 2024-02-19 16:07:11 +08:00			`if var_node.dtype() == paddle.float64`
Skip the int input operator when inserting a quant node & fix some bug (#49926) 2023-02-01 16:33:04 +08:00			`else 'float32'`
			`)`
			`_init_var_node(`
			`scale_var_node,`
			`np.array(scale_value, dtype=data_type),`
			`self._scope,`
			`self._place,`
			`)`
			`else:`
Fix the problem that the quantization model cannot find the weight (#49664) 2023-01-10 16:03:55 +08:00			`_logger.warning(`
[CodeStyle][ruff] fix v0.3.3 UP032 (#63111) 2024-04-01 10:20:33 +08:00			`f"Cannot find the target node {var_name} in scope, so skip adding quant node."`
Fix the problem that the quantization model cannot find the weight (#49664) 2023-01-10 16:03:55 +08:00			`)`
			`return None`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`try:`
			`zero_point_node = graph._find_node_by_name(`
			`graph.all_persistable_nodes(),`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`f"{quant_var_node.name()}@zero_point",`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`except:`
			`zero_point_node = graph.create_persistable_node(`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`name=f"{quant_var_node.name()}@zero_point",`
[Lod][fluid_ops] LOD_TENSOR (#69341) * Fix * Fix 2024-11-13 16:43:16 +08:00			`var_type=core.VarDesc.VarType.DENSE_TENSOR,`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`shape=scale_var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=core.VarDesc.VarType.INT32,`
			`)`
			`_init_var_node(`
			`zero_point_node,`
			`np.zeros(scale_var_node.shape(), dtype="int32"),`
			`self._scope,`
			`self._place,`
			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00
			`inputs = {"X": var_node, "Scale": scale_var_node}`
			`if zero_point_node is not None:`
			`inputs["ZeroPoint"] = zero_point_node`

Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`attrs = {`
			`"quant_axis": quant_axis,`
			`"bit_length": self._quant_bits,`
			`"only_observer": self._only_observer,`
			`}`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`attrs["op_role"] = core.op_proto_and_checker_maker.OpRole.Forward`
			`outputs = {"Y": quant_var_node}`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`quant_op_node = graph.create_op_node(`
			`op_type="quantize_linear",`
			`attrs=attrs,`
			`inputs=inputs,`
			`outputs=outputs,`
			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00
			`graph.link_to(var_node, quant_op_node)`
			`graph.link_to(scale_var_node, quant_op_node)`
			`if zero_point_node is not None:`
			`graph.link_to(zero_point_node, quant_op_node)`
			`graph.link_to(quant_op_node, quant_var_node)`

			`# add dequant_linear node`
			`dequant_var_node = graph.create_var_node(`
[CodeStyle][UP030][UP031][UP032] using f-string (#52062) * autofix Co-authored-by: Liyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2023-03-31 10:11:56 +08:00			`name=f"{quant_var_node.name()}.dequantized",`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`var_type=quant_var_node.type(),`
			`shape=quant_var_node.shape(),`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`var_dtype=quant_var_node.dtype(),`
			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00
			`inputs = {"X": quant_var_node, "Scale": scale_var_node}`
			`if zero_point_node is not None:`
			`inputs["ZeroPoint"] = zero_point_node`

Add observer attribute in qdq node & Add quant config for different backends. (#46887) 2023-01-06 17:52:56 +08:00			`attrs = {`
			`"quant_axis": -1,`
			`"bit_length": self._quant_bits,`
			`"only_observer": self._only_observer,`
			`}`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00			`attrs["op_role"] = core.op_proto_and_checker_maker.OpRole.Forward`

[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`dequant_op_node = graph.create_op_node(`
			`op_type="dequantize_linear",`
			`attrs=attrs,`
			`inputs=inputs,`
			`outputs={"Y": dequant_var_node},`
			`)`
update quantization new format (#46305) 2022-09-26 19:57:56 +08:00
			`graph.link_to(quant_var_node, dequant_op_node)`
			`graph.link_to(scale_var_node, dequant_op_node)`
			`if zero_point_node is not None:`
			`graph.link_to(zero_point_node, dequant_op_node)`
			`graph.link_to(dequant_op_node, dequant_var_node)`
			`return dequant_var_node`
Add a pass to insert QDQ nodes before skip connection (#59009) 2023-12-04 15:26:43 +08:00

			`class AddQuantDequantForResidual:`
			`"""`
			`Quantize the residual connections. Add quant and dequant ops for the residual inputs.`
			`"""`

			`def __init__(`
			`self,`
			`scope,`
			`place,`
			`quant_bits=8,`
			`is_test=True,`
			`):`
			`"""`
			`Args:`
			`scope(static.Scope): The scope is used to initialize these new parameters.`
			`place(paddle.CPUPlace\|paddle.CUDAPlace\|str): place is used to restore the weight tensors.`
			If it's string, it can be ``cpu``, and ``gpu:x``, where ``x`` is the index of the GPUs.
			`quant_bits(int, optional): quantization bit number for weight. Default is 8.`
			`is_test(bool, optional): Whether quantization with training or not. Default is True.`
			`"""`
			`self._place = _get_paddle_place(place)`
			`self._scope = scope`
			`self._quant_bits = quant_bits`
			`self._is_test = is_test`
			`assert self._scope is not None, "scope must not be None."`
			`assert self._place is not None, "place must not be None."`

			`def apply(self, graph):`
			`"""`
			`Args:`
			`graph(IrGraph): the target graph.`
			`"""`
[CodeStyle] `black -> ruff format` migration - part 33 (#74747) 2025-08-21 02:07:41 +08:00			`assert isinstance(graph, IrGraph), (`
			`'graph must be the instance of IrGraph.'`
			`)`
Add a pass to insert QDQ nodes before skip connection (#59009) 2023-12-04 15:26:43 +08:00			`weight_var_names = self._all_weight_node_names(graph)`
			`var_node_names_with_order = self._var_name_order(graph)`
			`for op in graph.all_op_nodes():`
			`if op.name() != 'elementwise_add':`
			`continue`
			`first_input_name = op.inputs[0].name()`
			`second_input_name = op.inputs[1].name()`
			`if (`
			`first_input_name in weight_var_names`
			`or second_input_name in weight_var_names`
			`):`
			`continue`
			`skip_node = (`
			`op.inputs[0]`
			`if var_node_names_with_order[first_input_name]`
			`< var_node_names_with_order[second_input_name]`
			`else op.inputs[1]`
			`)`
			`self._insert_quant_dequant(graph, skip_node, op)`

			`def _all_weight_node_names(self, graph):`
			`"""`
			`Return a list of weight variables (including casted weight)`
			`"""`
			`weight_var_names = [`
			`node.name() for node in graph.all_persistable_nodes()`
			`]`
			`for op in graph.all_op_nodes():`
			`if op.name() == 'cast' and op.inputs[0].persistable():`
			`weight_var_names.append(op.outputs[0].name())`

			`return weight_var_names`

			`def _var_name_order(self, graph):`
			`"""`
			`Return a dictionary with variable names as key and their order as value`
			`"""`
			`ordered_ops = graph.topology_sort()`
			`var_node_names_with_order = {}`
			`for idx, op_node in enumerate(ordered_ops):`
			`for in_var_node in op_node.inputs:`
			`in_var_name = in_var_node.name()`
			`if var_node_names_with_order.get(in_var_name) is None:`
			`var_node_names_with_order[in_var_name] = idx`

			`return var_node_names_with_order`

			`def _insert_quant_dequant(self, graph, var_node, op):`
			`"""`
Fix excuted executed (#61153) * Fix * Fix 2024-01-26 10:25:04 +08:00			`Insert per tensor quantize_linear and dequantize_linear node between var_node and op`
Add a pass to insert QDQ nodes before skip connection (#59009) 2023-12-04 15:26:43 +08:00			`"""`
			`insert_quant_pass = InsertQuantizeLinear(`
			`self._place,`
			`self._scope,`
			`quant_bits=self._quant_bits,`
			`quant_axis=-1,`
			`channel_wise=False,`
			`is_test=self._is_test,`
			`)`
			`quant_var_name = var_node.name() + '.skip'`
			`op_role = op.op().attr("op_role")`
			`(`
			`quant_var_node,`
			`scale_var_node,`
			`) = insert_quant_pass.insert_quant_op(`
			`graph, var_node, var_name=quant_var_name, op_role=op_role`
			`)`
			`dequant_var_node = insert_quant_pass.insert_dequant_op(`
			`graph, quant_var_node, scale_var_node, op_role`
			`)`
			`graph.update_input_link(var_node, dequant_var_node, op)`