Blame: python/paddle/tensorrt/export.py - PaddlePaddle/Paddle

PaddlePaddle / Paddle UNCLAIMED

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

0 0 0 C++

Normal View History Raw

[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`# Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved`
			`#`
			`# Licensed under the Apache License, Version 2.0 (the "License");`
			`# you may not use this file except in compliance with the License.`
			`# You may obtain a copy of the License at`
			`#`
			`# http://www.apache.org/licenses/LICENSE-2.0`
			`#`
			`# Unless required by applicable law or agreed to in writing, software`
			`# distributed under the License is distributed on an "AS IS" BASIS,`
			`# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.`
			`# See the License for the specific language governing permissions and`
			`# limitations under the License.`

			`from __future__ import annotations`

[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`import logging`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`import os`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`from enum import Enum`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
			`import numpy as np`

			`import paddle`
			`from paddle.base import core, dygraph`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`from paddle.base.executor import scope_guard`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`from paddle.base.framework import (`
			`Variable,`
			`)`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`from paddle.base.log_helper import get_logger`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`from paddle.jit.api import (`
			`_get_function_names_from_layer,`
			`get_ast_static_function,`
			`to_static,`
			`)`
			`from paddle.jit.dy2static.program_translator import (`
			`StaticFunction,`
			`)`
			`from paddle.nn import Layer`
			`from paddle.tensorrt.converter import PaddleToTensorRTConverter`
			`from paddle.tensorrt.util import (`
			`forbid_op_lower_trt,`
[CodeStyle][Typos][B-14,B-[17-19]] Fix typos(`Broardcast`,`Bradcast`,`Boardcast`,`buitin`,`buitlin`,`Buitin`,`builded`,`ba`) (#69966) * [CodeStyle][Typos][B-14,B-[17-19]] Fix typos * [CodeStyle][Typos][B-14,B-[17-19]] Fix typos(Broardcast,Bradcast,Boardcast,buitin,buitlin,Buitin,builded,ba) 2024-12-06 10:32:32 +08:00			`mark_builtin_op,`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`run_pir_pass,`
[Inference]Refactor PIR-TRT weight code (#70762) * refactor trt weight logic * fix bugs * fix bugs * perfect * perfect constant folding * fix bugs * fix bus * fix bugs 2025-01-13 14:20:08 +08:00			`run_trt_partition,`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`warmup_shape_infer,`
			`)`

【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`_logger = get_logger(`
			`__name__, logging.INFO, fmt='%(asctime)s-%(levelname)s: %(message)s'`
			`)`

[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
			`class Input:`
			`def __init__(`
			`self,`
[Inference]Modify TensorRT Input data name to collect shape in PIR (#71281) * Update export.py * fix codestyle * Update export.py * update 2025-02-27 19:28:28 +08:00			`warmup_data: tuple[np.ndarray, ...] \| None = None,`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`min_input_shape: tuple \| None = None,`
			`max_input_shape: tuple \| None = None,`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`optim_input_shape: tuple \| None = None,`
			`input_data_type: str \| None = 'float32',`
			`input_range: tuple \| None = None,`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00			`name: str \| None = None,`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`) -> None:`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00			`"""`
			`A class used to configure input data for models. This class serves two purposes:`

			`1. Random Data Generation: When no input data is supplied, it automatically generates random input data based on the specified minimum, optimal, and maximum shapes. In this mode,you can configure the data type (e.g., 'float32', 'int64', etc.) and the range of values (e.g.,(0.0, 1.0) for floats or (1, 10) for integers).`

			2. User-Provided Input: Alternatively, you can supply your own input data via the `warmup_data` argument. In this case, the provided data will be used directly, and the`input_data_type` and `input_range` settings will be ignored.

			`Args:`
			`warmup_data (tuple):`
			`The tuple of actual input data (for the automatic shape collection mechanism).`
			`min_input_shape (tuple):`
			`The shape of the minimum input tensor.`
			`max_input_shape (tuple):`
			`The shape of the maximum input tensor.`
			`optim_input_shape (tuple):`
			`The shape of the optimal input tensor.`
			`input_data_type (str, optional):`
			`The data type for the input tensors, such as 'float32' or 'int64' or 'float32' or 'int32' (default is float32).`
			`This option only applies when min_input_shape, optim_input_shape, and max_input_shape are provided; it does not apply to warmup_data.`
			`input_range (tuple, optional):`
			`The range of values used to generate input data. For floats, the default range is (0.0, 1.0). For integers, the default range is (1, 10).`
			`This option only applies when min_input_shape, optim_input_shape, and max_input_shape are provided; it does not apply to warmup_data.`
			`name:(str,optional):`
			`The name of the input to the model.`
			`Returns:`
			`None`

			`Examples:`
[CodeStyle][DocFormat][112] Use `pycon` marker in sparse/tensorrt docs (#77975) 2026-02-19 15:10:10 +08:00			`.. code-block:: pycon`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00
			`>>> # example 1:`
			`>>> from paddle.tensorrt.export import Input`
			`>>> input_config = Input(`
			`>>> min_input_shape=(1,100),`
			`>>> optim_input_shape=(4,100),`
			`>>> max_input_shape=(8,100),`
			`>>> )`
[CodeStyle][DocFormat][112] Use `pycon` marker in sparse/tensorrt docs (#77975) 2026-02-19 15:10:10 +08:00			`>>> input_config.input_data_type = 'int64'`
			`>>> input_config.input_range = (1, 10)`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00
			`>>> # example 2:`
			`>>> from paddle.tensorrt.export import Input`
			`>>> import numpy as np`
			`>>> input_config = Input(`
			`>>> warmup_data=(`
			`>>> np.random.rand(1,100).astype(np.float32),`
			`>>> np.random.rand(4,100).astype(np.float32),`
			`>>> np.random.rand(8,100).astype(np.float32),`
			`>>> )`
			`>>> )`
			`"""`
[Inference]Modify TensorRT Input data name to collect shape in PIR (#71281) * Update export.py * fix codestyle * Update export.py * update 2025-02-27 19:28:28 +08:00			`if warmup_data is not None:`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`if min_input_shape or max_input_shape or optim_input_shape:`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`raise ValueError(`
			`"warmup data provided; min/max/optim shapes are ignored."`
			`)`
			`if input_data_type is not None or input_range is not None:`
			`_logger.warning(`
			`"When warmup_data is provided,input_data_type and input_range are ignored."`
fix typo disable_loggling -> disable_logging (#75978) * fix typo disable_loggling -> disable_logging * fix * fix 2025-10-22 15:01:25 +08:00			`"These parameters only apply when generate random data using min/opt/max shapes."`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`)`
			`else:`
			`if None in (min_input_shape, max_input_shape, optim_input_shape):`
			`raise ValueError(`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`"When warm_data is None, min/max/optim shapes must be specified."`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`)`

[Inference]Modify TensorRT Input data name to collect shape in PIR (#71281) * Update export.py * fix codestyle * Update export.py * update 2025-02-27 19:28:28 +08:00			`self.warmup_data = warmup_data`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`self.min_input_shape = min_input_shape`
			`self.max_input_shape = max_input_shape`
			`self.optim_input_shape = optim_input_shape`
			`self.input_data_type = input_data_type`
			`self.input_range = input_range`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00			`self.name = name`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
			`def generate_input_data(self):`
			`"""`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`Generates random input data based on the user-specified min_input_shape, optim_input_shape, and max_input_shape, as well as the data type and input range.`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
			`Returns:`
			`tuple(numpy.ndarray, numpy.ndarray, numpy.ndarray): A tuple containing the generated input data for the minimum, optimal, and maximum shapes.`

			`Examples:`
[CodeStyle] Enable docstring code format and start to use `pycon` as syntax highlight marker (#76542) 2025-11-24 15:45:41 +08:00			`.. code-block:: pycon`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00
[CodeStyle] Enable docstring code format and start to use `pycon` as syntax highlight marker (#76542) 2025-11-24 15:45:41 +08:00			`>>> from paddle.tensorrt.export import Input`
			`>>> input_config = Input(`
			`>>> min_input_shape=(1,100),`
			`>>> optim_input_shape=(4,100),`
			`>>> max_input_shape=(8,100),`
			`>>> )`
			`>>> input.input_data_type = 'int64'`
			`>>> input.input_range = (1, 10)`
[CodeStyle][Xdoctest][8,12,15,16,18-20,22-26,29-32,34-36,38-40,42,43,45-48,50,51,60-65,75,80,82,83,85-87,89-94,99-141,143,145,147-167,169-187,207-220,257,258,260-275,277-313,315-325][API Compatibility] Update shape output format in documentation examples (#76574) --------- Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2025-11-26 10:31:46 +08:00			`>>> input_min_data, input_optim_data, input_max_data = input_config.generate_input_data()`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`"""`
[Inference]Modify TensorRT Input data name to collect shape in PIR (#71281) * Update export.py * fix codestyle * Update export.py * update 2025-02-27 19:28:28 +08:00			`if self.warmup_data is not None:`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`raise RuntimeError(`
			`"generate_input_data() should not be called when warmup_data is provided."`
			`)`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`if self.input_range is None:`
			`self.input_range = (`
			`(0.0, 1.0) if 'float' in self.input_data_type else (1, 10)`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`)`
【Paddle Tensor】Fix bugs related to converting unit tests of the old ir-trt into pir-trt (#71083) * fix share_data * Update test_trt_convert_share_data.py * update auto_scan * fix set_value * fix codestyle * Update test_trt_convert_set_value.py * Update test_trt_convert_clip.py * fix grid_sampler * fix instance_norm * fix index_select * fix affine_channel * fix anchor_generator * fix where * fix * fix codestyle * fix group_norm * fix index_put * modify preln_residual_bias * fix range * Update test_trt_convert_linear_interp_v2.py * fix codestyle * fix * Update test_trt_convert_range.py * fix embedding * fix prelu * fix codestyle * fix codestyle * fix leaky_relu * fix shuffle_channel * fix yolo_box * fix codestyle * fix take_along_axis * fix gather * fix gather_nd * fix codestyle * fix einsum * Update export.py * fix codestyle * Revert "fix codestyle" This reverts commit 111955690db816a9e37c4e6c45c12dfc5475c39a. * fix codestyle * fix * fix * add * fix affine_channel * fix index_select * fix gather * fix gather * fix index_select * fix last time * really last time * Update test_trt_convert_gather.py * Update test_trt_convert_gather.py * update timeout * update timeout * Update CMakeLists.txt * fix last time * Update CMakeLists.txt * last time 2025-03-19 15:39:02 +08:00			`low, high = self.input_range`

			`if low == high:`
			`self.input_min_data = np.full(`
			`self.min_input_shape, low, dtype=self.input_data_type`
			`)`
			`self.input_optim_data = np.full(`
			`self.optim_input_shape, low, dtype=self.input_data_type`
			`)`
			`self.input_max_data = np.full(`
			`self.max_input_shape, low, dtype=self.input_data_type`
			`)`
			`return (`
			`self.input_min_data,`
			`self.input_optim_data,`
			`self.input_max_data,`
			`)`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`if 'int' in self.input_data_type:`
			`self.input_min_data = np.random.randint(`
			`low, high, size=self.min_input_shape`
			`).astype(self.input_data_type)`
			`self.input_optim_data = np.random.randint(`
			`low, high, size=self.optim_input_shape`
			`).astype(self.input_data_type)`
			`self.input_max_data = np.random.randint(`
			`low, high, size=self.max_input_shape`
			`).astype(self.input_data_type)`
			`else:`
			`self.input_min_data = np.random.uniform(`
			`low, high, size=self.min_input_shape`
			`).astype(self.input_data_type)`
			`self.input_optim_data = np.random.uniform(`
			`low, high, size=self.optim_input_shape`
			`).astype(self.input_data_type)`
			`self.input_max_data = np.random.uniform(`
			`low, high, size=self.max_input_shape`
			`).astype(self.input_data_type)`

			`return (`
			`self.input_min_data,`
			`self.input_optim_data,`
			`self.input_max_data,`
			`)`

[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`class PrecisionMode(Enum):`
			`FP32 = "FP32"`
			`FP16 = "FP16"`
			`BF16 = "BF16"`
			`INT8 = "INT8"`

			`"""`
			`This class defines different precision modes that can be used to configure`
			`TensorRT optimization. The modes include FP32, FP16, BF16, and INT8.`
			`Specifies the precision mode for TensorRT optimization. The options are:`
			`- PrecisionMode.FP32: 32-bit floating point precision (default).`
			`- PrecisionMode.FP16: 16-bit floating point precision.`
			`- PrecisionMode.INT8: 8-bit integer precision.`
fix typo disable_loggling -> disable_logging (#75978) * fix typo disable_loggling -> disable_logging * fix * fix 2025-10-22 15:01:25 +08:00			`- PrecisionMode.BF16: 16-bit Brain Floating Point precision. Only supported in TensorRT versions greater than 9.0.`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`"""`


[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`class TensorRTConfig:`
			`def __init__(`
			`self,`
			`inputs: list,`
			`min_subgraph_size: int \| None = 3,`
			`save_model_dir: str \| None = None,`
			`disable_ops: str \| list \| None = None,`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`precision_mode: PrecisionMode = PrecisionMode.FP32,`
[Paddle TensorRT] Fix PaddleX model bugs when convert to pir-trt (#69957) * fix * fix pd_op.squeeze+pd_op.flatten * fix * fix * fix * fix * fix * 添加pool2d全图进trt * fix 2024-12-09 19:52:39 +08:00			`ops_run_float: str \| list \| None = None,`
[Paddle TensorRT] add pd_op.logical_not converter (#70267) * pd_op.logical_not * fix * fix * fix * fix * fix * fix * fix * fix * fix * add optimization_level 2024-12-19 21:22:36 +08:00			`optimization_level: int \| None = 3,`
[Inference]Support some pass use in converter (#70529) * support pass in converter * fix bugs * fix unittest * resolve conflict * resolve unittest * fix unittest * fix unittest * reduce file * perfect comment 2025-01-09 16:25:01 +08:00			`disable_passes: list = [],`
[Paddle TensorRT]add `workspace_size` to TensorRTConfig (#70778) * update workspace_size * fix codestyle 2025-01-23 16:12:52 +08:00			`workspace_size: int \| None = 1 << 30,`
[Inference]Support trt cuda graph in PIR (#71982) * support use graph * Update test_converter_model_resnet50.py * fix codestyle * fix codestyle * delete warmup * warm up * update * Update tensorrt_engine_instruction.cc * fix codestyle * Update test_converter_model_resnet50.py * Update pybind.cc 2025-04-09 17:35:09 +08:00			`use_cuda_graph: bool \| None = False,`
【Paddle TensorRT】Pir-trt support TensorRT Refittable (#71501) * pir-refittable * 修改pd_op.layer_norm converter * fix * fix * fix * fix * fix * merge * Revert "fix" This reverts commit 5a4df6e4c7f4b9bd8aa6bf1ffcae22f076d20d00. * merge * fix * 功能完成 * resnet50能够refit * fix * fix * fix * fix * fix * fix * review修改完毕 * Add a new class called RefitManager that is solely used for managing content related to refitting. * add forbid_cast_op * codestyple-check * Add a convert function to ensure that the model name and weight name provided by the user are consistent, and fix the issue of duplicate naming between pd_op.batch_norm and weight_to_tensor. * fix the issue of duplicate naming * Only refit those with inputs from builtin.parameters * Fix the compilation and linking errors in test_tensorrt_engine_instruction.cc. * Remove unnecessary code from util.py * fix pd_op.fused_bias_dropout_residual_layer_norm * delete ceshi.md * deleted the extra comments * fix pd_op.batch_norm * fix * fix * fix * Added an extra unit test by mistake, please delete it. * fix pd_op.fused_conv2d_add_act * fix pd_op.fused_conv2d_add_act * Add the bias to pd_op.fused_conv2d_add_act. * The input for the bias might also be a builtin.constant * fix pd_op.fused_conv2d_add_act * fix bug * fix TestInstanceNormWith3DInputTRTPattern 2025-04-08 10:48:16 +08:00			`refit_params_path: str \| None = None,`
fix typo disable_loggling -> disable_logging (#75978) * fix typo disable_loggling -> disable_logging * fix * fix 2025-10-22 15:01:25 +08:00			`disable_logging: bool \| None = True,`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`) -> None:`
			`"""`
			`A class for configuring TensorRT optimizations.`

			`Args:`
			`inputs (list):`
			`A list of Input configurations`
			`min_subgraph_size (int, optional):`
			`The minimum number of operations in a subgraph for TensorRT to optimize (default is 3).`
			`save_model_dir (str, optional):`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`The directory where the optimized model will be saved (default is not to save).`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`disable_ops : (str\|list, optional):`
			`A string representing the names of operations that should not be entering by TensorRT (default is None).`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`precision_mode (PrecisionMode, optional):`
			`Specifies the precision mode for TensorRT optimization. The options are:`
			`- PrecisionMode.FP32: 32-bit floating point precision (default).`
			`- PrecisionMode.FP16: 16-bit floating point precision.`
			`- PrecisionMode.INT8: 8-bit integer precision.`
fix typo disable_loggling -> disable_logging (#75978) * fix typo disable_loggling -> disable_logging * fix * fix 2025-10-22 15:01:25 +08:00			`- PrecisionMode.BF16: 16-bit Brain Floating Point precision. Only supported in TensorRT versions greater than 9.0.`
[Paddle TensorRT] Fix PaddleX model bugs when convert to pir-trt (#69957) * fix * fix pd_op.squeeze+pd_op.flatten * fix * fix * fix * fix * fix * 添加pool2d全图进trt * fix 2024-12-09 19:52:39 +08:00			`ops_run_float (str\|list, optional):`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			A set of operation names that should be executed using FP32 precision regardless of the `tensorrt_precision_mode` setting.
[Paddle TensorRT] add pd_op.logical_not converter (#70267) * pd_op.logical_not * fix * fix * fix * fix * fix * fix * fix * fix * fix * add optimization_level 2024-12-19 21:22:36 +08:00			`optimization_level (int, optional):`
			`Set TensorRT optimization level (default is 3). Only supported in TensorRT versions greater than 8.6.`
[Inference]Support some pass use in converter (#70529) * support pass in converter * fix bugs * fix unittest * resolve conflict * resolve unittest * fix unittest * fix unittest * reduce file * perfect comment 2025-01-09 16:25:01 +08:00			`disable_passes : (str\|list, optional):`
			`A list of string representing the names of pass that should not be used for origin program (default is []).`
[Paddle TensorRT]add `workspace_size` to TensorRTConfig (#70778) * update workspace_size * fix codestyle 2025-01-23 16:12:52 +08:00			`workspace_size (int, optional):`
			`Specifies the maximum GPU memory (in bytes) that TensorRT can use for the optimization process (default is 1 << 30).`
[Inference]Support trt cuda graph in PIR (#71982) * support use graph * Update test_converter_model_resnet50.py * fix codestyle * fix codestyle * delete warmup * warm up * update * Update tensorrt_engine_instruction.cc * fix codestyle * Update test_converter_model_resnet50.py * Update pybind.cc 2025-04-09 17:35:09 +08:00			`use_cuda_graph (bool, optional):`
			`Specify whether TensorRT enables cuda_graph during the optimization process (default is false).`
【Paddle TensorRT】Pir-trt support TensorRT Refittable (#71501) * pir-refittable * 修改pd_op.layer_norm converter * fix * fix * fix * fix * fix * merge * Revert "fix" This reverts commit 5a4df6e4c7f4b9bd8aa6bf1ffcae22f076d20d00. * merge * fix * 功能完成 * resnet50能够refit * fix * fix * fix * fix * fix * fix * review修改完毕 * Add a new class called RefitManager that is solely used for managing content related to refitting. * add forbid_cast_op * codestyple-check * Add a convert function to ensure that the model name and weight name provided by the user are consistent, and fix the issue of duplicate naming between pd_op.batch_norm and weight_to_tensor. * fix the issue of duplicate naming * Only refit those with inputs from builtin.parameters * Fix the compilation and linking errors in test_tensorrt_engine_instruction.cc. * Remove unnecessary code from util.py * fix pd_op.fused_bias_dropout_residual_layer_norm * delete ceshi.md * deleted the extra comments * fix pd_op.batch_norm * fix * fix * fix * Added an extra unit test by mistake, please delete it. * fix pd_op.fused_conv2d_add_act * fix pd_op.fused_conv2d_add_act * Add the bias to pd_op.fused_conv2d_add_act. * The input for the bias might also be a builtin.constant * fix pd_op.fused_conv2d_add_act * fix bug * fix TestInstanceNormWith3DInputTRTPattern 2025-04-08 10:48:16 +08:00			`refit_params_path(str, optional):`
			`The path to the weights that need to be refitted.`
fix typo disable_loggling -> disable_logging (#75978) * fix typo disable_loggling -> disable_logging * fix * fix 2025-10-22 15:01:25 +08:00			`disable_logging (bool, optional):`
[Inference]Support control trt_glog_info in PIR (#72429) * add_debug_trt * Update paddle_analysis_config.h * fix codestyle * fix codestyle * Update tensorrt_test_base.py * fix * Update paddle_analysis_config.h * Update paddle_analysis_config.h * update * fix codestyle * update * fix codestyle 2025-05-07 12:30:34 +08:00			`Specifies whether to enable GLOG info output during the optimization process (default is true).`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`Returns:`
			`None`

			`Examples:`
[CodeStyle] Enable docstring code format and start to use `pycon` as syntax highlight marker (#76542) 2025-11-24 15:45:41 +08:00			`.. code-block:: pycon`

			`>>> # example 1:`
			`>>> from paddle.tensorrt.export import (`
			`>>> Input,`
			`>>> TensorRTConfig,`
			`>>> PrecisionMode,`
			`>>> )`
			`>>> input_config = Input(`
			`>>> min_input_shape=(1,100),`
			`>>> optim_input_shape=(4,100),`
			`>>> max_input_shape=(8,100),`
			`>>> )`
			`>>> input_config.input_data_type = 'int64'`
			`>>> input_config.input_range = (1, 10)`

			`>>> trt_config = TensorRTConfig(inputs=[input_config])`
			`>>> trt_config.disable_ops = ["pd_op.dropout"]`
			`>>> trt_config.precision_mode = PrecisionMode.FP16`
			`>>> trt_config.ops_run_float = "pd_op.conv2d"`
			`>>> trt_config.workspace_size = 1 << 32`

			`>>> # example 2:`
			`>>> from paddle.tensorrt.export import (`
			`>>> Input,`
			`>>> TensorRTConfig,`
			`>>> PrecisionMode,`
			`>>> )`
			`>>> input_config = Input(`
			`>>> warmup_data=(`
			`>>> np.random.rand(1,100).astype(np.float32),`
			`>>> np.random.rand(4,100).astype(np.float32),`
			`>>> np.random.rand(8,100).astype(np.float32),`
			`>>> )`
			`>>> )`
			`>>> trt_config = TensorRTConfig(inputs=[input_config])`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`"""`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`# Checking Input Consistency`
[Inference]Modify TensorRT Input data name to collect shape in PIR (#71281) * Update export.py * fix codestyle * Update export.py * update 2025-02-27 19:28:28 +08:00			`has_input_data = [i.warmup_data is not None for i in inputs]`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`if any(has_input_data):`
			`if not all(has_input_data):`
			`raise ValueError("All Inputs must have input_data if any does.")`

[Paddle TensorRT] add pd_op.shape converter (#68395) * shape converter * 适配多输入 * ci bug * 命名 * fix 2024-09-26 16:06:54 +08:00			`self.inputs = inputs`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`self.min_subgraph_size = min_subgraph_size`
			`self.save_model_dir = save_model_dir`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`self.precision_mode = precision_mode`
[Paddle TensorRT] Fix PaddleX model bugs when convert to pir-trt (#69957) * fix * fix pd_op.squeeze+pd_op.flatten * fix * fix * fix * fix * fix * 添加pool2d全图进trt * fix 2024-12-09 19:52:39 +08:00			`self.ops_run_float = ops_run_float`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`self.disable_ops = disable_ops`
[Inference]Support some pass use in converter (#70529) * support pass in converter * fix bugs * fix unittest * resolve conflict * resolve unittest * fix unittest * fix unittest * reduce file * perfect comment 2025-01-09 16:25:01 +08:00			`self.disable_passes = disable_passes`
[Paddle TensorRT] add pd_op.logical_not converter (#70267) * pd_op.logical_not * fix * fix * fix * fix * fix * fix * fix * fix * fix * add optimization_level 2024-12-19 21:22:36 +08:00			`self.optimization_level = optimization_level`
[Paddle TensorRT]add `workspace_size` to TensorRTConfig (#70778) * update workspace_size * fix codestyle 2025-01-23 16:12:52 +08:00			`self.workspace_size = workspace_size`
[Inference]Support trt cuda graph in PIR (#71982) * support use graph * Update test_converter_model_resnet50.py * fix codestyle * fix codestyle * delete warmup * warm up * update * Update tensorrt_engine_instruction.cc * fix codestyle * Update test_converter_model_resnet50.py * Update pybind.cc 2025-04-09 17:35:09 +08:00			`self.use_cuda_graph = use_cuda_graph`
【Paddle TensorRT】Pir-trt support TensorRT Refittable (#71501) * pir-refittable * 修改pd_op.layer_norm converter * fix * fix * fix * fix * fix * merge * Revert "fix" This reverts commit 5a4df6e4c7f4b9bd8aa6bf1ffcae22f076d20d00. * merge * fix * 功能完成 * resnet50能够refit * fix * fix * fix * fix * fix * fix * review修改完毕 * Add a new class called RefitManager that is solely used for managing content related to refitting. * add forbid_cast_op * codestyple-check * Add a convert function to ensure that the model name and weight name provided by the user are consistent, and fix the issue of duplicate naming between pd_op.batch_norm and weight_to_tensor. * fix the issue of duplicate naming * Only refit those with inputs from builtin.parameters * Fix the compilation and linking errors in test_tensorrt_engine_instruction.cc. * Remove unnecessary code from util.py * fix pd_op.fused_bias_dropout_residual_layer_norm * delete ceshi.md * deleted the extra comments * fix pd_op.batch_norm * fix * fix * fix * Added an extra unit test by mistake, please delete it. * fix pd_op.fused_conv2d_add_act * fix pd_op.fused_conv2d_add_act * Add the bias to pd_op.fused_conv2d_add_act. * The input for the bias might also be a builtin.constant * fix pd_op.fused_conv2d_add_act * fix bug * fix TestInstanceNormWith3DInputTRTPattern 2025-04-08 10:48:16 +08:00			`self.refit_params_path = refit_params_path`
fix typo disable_loggling -> disable_logging (#75978) * fix typo disable_loggling -> disable_logging * fix * fix 2025-10-22 15:01:25 +08:00			`self.disable_logging = disable_logging`
【TensorRT】Fix the situation where the filter in the convert_conv function is the output of another operator (#72302) * fix bugs * fix conv3d * fix codestyle * fix * fix codestyle * Update converter_utils.py * Update converter_utils.py * Update converter_utils.py * fix codestyle * Update converter_utils.py * fix 2025-04-18 12:23:11 +08:00			`if self.refit_params_path:`
			`self.disable_passes.append("constant_folding_pass")`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`paddle.framework.set_flags(`
			`{'FLAGS_trt_min_group_size': min_subgraph_size}`
			`)`


			`# return an optimized program with pd_op.tensorrt_engine operations.`
			`def convert_to_trt(program, trt_config, scope):`
			`if not isinstance(program, paddle.base.libpaddle.pir.Program):`
			`raise TypeError(`
			`f"program type must be paddle.base.libpaddle.pir.Program, but received {type(program)}"`
			`)`

			`feed_name = []`
			`for op in program.global_block().ops:`
			`if op.name() == "pd_op.data" or op.name() == "pd_op.feed":`
			`param_name = op.attrs()["name"]`
			`feed_name.append(param_name)`

			`with paddle.pir_utils.IrGuard():`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`feeds = []`
			`if trt_config.inputs[0].warmup_data is not None:`
			`input_tuples = [inp.warmup_data for inp in trt_config.inputs]`
			`# Check all inputs have the same number of warmup_data samples`
			`assert len({len(t) for t in input_tuples}) == 1`
			`num_samples = len(input_tuples[0])`
			`for sample_idx in range(num_samples):`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00			`feed_dict = {}`
			`for i, inp in enumerate(trt_config.inputs):`
			`name = inp.name if inp.name is not None else feed_name[i]`
			`feed_dict[name] = input_tuples[i][sample_idx]`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`feeds.append(feed_dict)`
			`else:`
			`input_tuples = [i.generate_input_data() for i in trt_config.inputs]`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00			`for i in range(len(input_tuples[0])):`
			`feed_dict = {}`
			`for j, inp in enumerate(trt_config.inputs):`
			`name = inp.name if inp.name is not None else feed_name[j]`
			`feed_dict[name] = input_tuples[j][i]`
			`feeds.append(feed_dict)`
[Inference]Support some pass use in converter (#70529) * support pass in converter * fix bugs * fix unittest * resolve conflict * resolve unittest * fix unittest * fix unittest * reduce file * perfect comment 2025-01-09 16:25:01 +08:00			`# run pir pass (including trt_op_marker_pass)`
			`program_with_pir = run_pir_pass(`
[Paddle TensorRT] add pd_op.shape converter (#68395) * shape converter * 适配多输入 * ci bug * 命名 * fix 2024-09-26 16:06:54 +08:00			`program,`
[Inference]Support some pass use in converter (#70529) * support pass in converter * fix bugs * fix unittest * resolve conflict * resolve unittest * fix unittest * fix unittest * reduce file * perfect comment 2025-01-09 16:25:01 +08:00			`disable_passes=trt_config.disable_passes,`
			`scope=scope,`
[Inference]Support INT8 quant in PIR-TRT (#71127) * support int8 quant in trt * support int8 quant in trt * fix coverage * perfect code 2025-02-17 16:03:53 +08:00			`precision_mode=trt_config.precision_mode,`
[Inference]Support some pass use in converter (#70529) * support pass in converter * fix bugs * fix unittest * resolve conflict * resolve unittest * fix unittest * fix unittest * reduce file * perfect comment 2025-01-09 16:25:01 +08:00			`)`

			`# run warmup for collecting shape`
			`program = warmup_shape_infer(`
			`program_with_pir,`
[Inference]Support TensorRT Input data to collect shape in PIR (#71235) * add_collect_shape * fix codestyle * fix * delete enable_collect_shape * Update export.py * Update export.py * Update export.py * Update export.py * fix codestyle * fix codestyle last time 2025-02-25 18:30:26 +08:00			`feeds=feeds,`
[Paddle TensorRT] add pd_op.shape converter (#68395) * shape converter * 适配多输入 * ci bug * 命名 * fix 2024-09-26 16:06:54 +08:00			`scope=scope,`
			`)`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
Fix Extra VRAM Usage in PPTRT (#76296) 2025-11-13 11:22:24 +08:00			`paddle.device.empty_cache()`

[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`# specify certain operators to be excluded from entering TensorRT`
			`if trt_config.disable_ops:`
			`forbid_op_lower_trt(program, trt_config.disable_ops)`

[Inference]Fix PIR-TRT bugs (#68776) * fix model bug * re-trigger * fix matmul bug * fix bugs 2024-11-05 20:30:09 +08:00			`# Adding marker labels to builtin ops facilitates convert processing, but they ultimately do not enter the TensorRT subgraph.`
[CodeStyle][Typos][B-14,B-[17-19]] Fix typos(`Broardcast`,`Bradcast`,`Boardcast`,`buitin`,`buitlin`,`Buitin`,`builded`,`ba`) (#69966) * [CodeStyle][Typos][B-14,B-[17-19]] Fix typos * [CodeStyle][Typos][B-14,B-[17-19]] Fix typos(Broardcast,Bradcast,Boardcast,buitin,buitlin,Buitin,builded,ba) 2024-12-06 10:32:32 +08:00			`mark_builtin_op(program)`
[Inference]Fix PIR-TRT bugs (#68776) * fix model bug * re-trigger * fix matmul bug * fix bugs 2024-11-05 20:30:09 +08:00
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`# run pir pass (including trt_sub_graph_extract_pass)`
[Inference]Refactor PIR-TRT weight code (#70762) * refactor trt weight logic * fix bugs * fix bugs * perfect * perfect constant folding * fix bugs * fix bus * fix bugs 2025-01-13 14:20:08 +08:00			`program_with_pir = run_trt_partition(program)`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
			`# Step4: run TRTConverter (would lower group_op into tensorrt_engine_op)`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`converter = PaddleToTensorRTConverter(`
			`program_with_pir, scope, trt_config=trt_config`
			`)`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`converter.convert_program_to_trt()`
			`trt_output_var = []`

			`for op in program_with_pir.global_block().ops:`
			`if op.name() == "pd_op.fetch":`
			`for operand in op.operands():`
			`source = operand.source()`
			`trt_output_var.append(source)`

			`# Save PIR program as JSON`
			`if trt_config.save_model_dir:`
			`input_values = []`
			`input_values.extend(`
			`result`
			`for op in program_with_pir.global_block().ops`
			`if op.name() == "pd_op.data" or op.name() == "pd_op.feed"`
			`for result in op.results()`
			`)`
			`place = paddle.CUDAPlace(0)`
			`exe = paddle.static.Executor(place)`

[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`with scope_guard(scope):`
			`paddle.static.save_inference_model(`
			`trt_config.save_model_dir,`
			`input_values,`
			`trt_output_var,`
			`exe,`
			`program=program_with_pir,`
			`)`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`return program_with_pir`


			`# Obtain a program with tensorrt_op for dynamic-to-static scenarios.`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`def _convert_(function=None, input_spec=None, config=None, **kwargs):`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`"""`
			`Convert a dynamic graph API to a static graph and apply TensorRT optimizations if relevant parameters are configured.`

			`Args:`
			`function (callable): Callable dynamic graph function. If it used as a`
			`decorator, the decorated function will be parsed as this parameter.`
			`input_spec (list[InputSpec]\|tuple[InputSpec]): list/tuple of InputSpec to`
			`specific the shape/dtype/name information of each input Tensor.`
			`config: (TensorRTConfig): The configuration of TensorRTConfig.`
			kwargs: Support keys including `property`, set `property` to True if the function
			`is python property.`

			`Returns:`
			`tuple: A tuple containing two elements. The first element is the TensorRT optimized program., optionally optimized with TensorRT if configured. The second element is the scope containing the parameters.`

			`"""`
			`# Converts dynamic graph APIs into static graph`
			`static_net = paddle.jit.to_static(`
			`function,`
			`input_spec=input_spec,`
			`**kwargs,`
			`)`
			`is_prim_infer = core._is_fwd_prim_enabled() and core._is_bwd_prim_enabled()`
			`# If the input layer be wrapped by DataParallel,`
			`# the args and kwargs of forward method will can't be parsed by`
			`# function_spec, so here we save DataParallel._layers instead`
			`# DataParallel it self`
			`# using inner_layer, do not change input layer`
			`if isinstance(static_net, paddle.DataParallel):`
			`inner_layer = static_net._layers`
			`else:`
			`inner_layer = static_net`

			`# avoid change user given input_spec`
			`inner_input_spec = None`
			`if input_spec is not None:`
			`if isinstance(static_net, Layer):`
			`for member_name in _get_function_names_from_layer(inner_layer):`
			`static_func = getattr(inner_layer, member_name, None)`
			`if (`
			`isinstance(static_func, StaticFunction)`
			`and 'forward' != member_name`
			`):`
			`raise ValueError(`
			`f"If there are static functions other than 'forward' that need to be saved, the input 'input_spec' should be None, but received the type of 'input_spec' is {type(input_spec)}."`
			`)`
			`if not isinstance(input_spec, (list, tuple)):`
			`raise TypeError(`
			`f"The input input_spec should be 'list', but received input_spec's type is {type(input_spec)}."`
			`)`
			`inner_input_spec = []`
			`for var in paddle.utils.flatten(input_spec):`
			`if isinstance(var, paddle.static.InputSpec):`
			`inner_input_spec.append(var)`
			`elif isinstance(`
			`var, (core.eager.Tensor, Variable, paddle.pir.Value)`
			`):`
			`inner_input_spec.append(`
			`paddle.static.InputSpec.from_tensor(var)`
			`)`
			`else:`
			# Support non-Tensor type in `input_spec`
			`inner_input_spec.append(var)`

			`# whether outermost layer has pre/post hook, if does, we need also save`
			`# these operators in program.`
			`with_hook = False`
			`scope = core.Scope()`
			`extra_var_info = {}`
			`if isinstance(static_net, Layer):`
			`functions = list(set(_get_function_names_from_layer(static_net)))`
			`functions = sorted(functions)`
			`if static_net._forward_pre_hooks or static_net._forward_post_hooks:`
			`with_hook = True`
			`else:`
			`# layer is function`
			`functions = [static_net]`

			`property_vals = [] # (value, key)`
			`concrete_program = None`
			`for attr_func in functions:`
			`if isinstance(static_net, Layer):`
			`static_func = get_ast_static_function(`
			`getattr(inner_layer, attr_func, None)`
			`)`
			`if isinstance(static_func, StaticFunction):`
			`if static_func.is_property:`
			`# property method to be exported`
			`immediate_val = static_func()`
			`property_vals.append(`
			`(`
			`immediate_val,`
			`static_net.__class__.__name__ + '.' + attr_func,`
			`)`
			`)`
			`continue`
			`concrete_program = (`
			`static_func.concrete_program_specify_input_spec(`
			`inner_input_spec,`
			`with_hook=with_hook,`
			`is_prim_infer=is_prim_infer,`
			`)`
			`)`
			`elif 'forward' == attr_func:`
			`# if input_spec is incomplete, declarative will throw error`
			`# inner_input_spec is list[InputSpec], it should be packed with same structure`
			`# as original input_spec here`
			`if inner_input_spec:`
			`inner_input_spec = paddle.utils.pack_sequence_as(`
			`input_spec, inner_input_spec`
			`)`
			`static_forward = to_static(`
			`inner_layer.forward,`
			`input_spec=inner_input_spec,`
			`full_graph=True,`
			`)`
			`concrete_program = (`
			`static_forward.concrete_program_specify_input_spec(`
			`with_hook=with_hook, is_prim_infer=is_prim_infer`
			`)`
			`)`
			`inner_input_spec = None`
			`else:`
			`continue`
			`else:`
			`# When layer is a function`
			`if isinstance(attr_func, StaticFunction):`
			`static_func = get_ast_static_function(attr_func)`
			`if static_func.is_property:`
			`immediate_val = static_func()`
			`property_vals.append((immediate_val, static_func))`
			`continue`

			`concrete_program = (`
			`static_func.concrete_program_specify_input_spec(`
			`inner_input_spec, is_prim_infer=is_prim_infer`
			`)`
			`)`
			`else:`
			`static_func = get_ast_static_function(attr_func)`
			`if inner_input_spec:`
			`inner_input_spec = paddle.utils.pack_sequence_as(`
			`input_spec, inner_input_spec`
			`)`
			`static_function = to_static(`
			`static_func,`
			`input_spec=inner_input_spec,`
			`full_graph=True,`
			`)`
			`concrete_program = static_function.concrete_program`

			# when save multi `StaticFunction`, all `StaticFunction` share params.
			`dygraph_state_dict = None`
			`if isinstance(inner_layer, Layer):`
			`dygraph_state_dict = inner_layer.to_static_state_dict()`
			`elif isinstance(attr_func, StaticFunction):`
[Dy2St] Use weakref for layer instance conversion to avoid circular reference (#68850) 2024-10-23 23:47:18 +08:00			`if static_func.class_instance:`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`dygraph_state_dict = (`
[Dy2St] Use weakref for layer instance conversion to avoid circular reference (#68850) 2024-10-23 23:47:18 +08:00			`static_func.class_instance.to_static_state_dict()`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`)`
			`if dygraph_state_dict:`
			`# we maintain the mapping of variable name to`
			`# structured name, the buffer variable (non-persistable)`
			`# saved to inference program may not need by dygraph Layer,`
			`# we only record the state_dict variable's structured name`
			`state_names_dict = {}`
			`state_var_dict = {}`
fix typo disable_loggling -> disable_logging (#75978) * fix typo disable_loggling -> disable_logging * fix * fix 2025-10-22 15:01:25 +08:00			`for structured_name, var in dygraph_state_dict.items():`
			`state_names_dict[var.name] = structured_name`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`state_var_dict[var.name] = var`
			`# share parameters from Layer to scope & record var info`
			`with dygraph.guard():`
			`for tensor, value in zip(*concrete_program.parameters):`
			`if not value.persistable:`
			`continue`
			`param_or_buffer_tensor = scope.var(value.name).get_tensor()`

			`src_tensor = state_var_dict[tensor.name].value().get_tensor()`
			`param_or_buffer_tensor._share_data_with(src_tensor)`
			`with paddle.pir_utils.IrGuard():`
			`main_program = concrete_program.main_program`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`output_vars = concrete_program.outputs`
			`paddle.base.executor._add_pir_fetch_ops(`
			`program=main_program, fetch_list=output_vars, fetch_var_name="fetch"`
			`)`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`program_with_trt = convert_to_trt(main_program, config, scope)`
			`return program_with_trt, scope`


			`# Obtain a program with tensorrt_op by directly loading the model.`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`def convert(model_path, config):`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`"""`
			`Loading a PaddlePaddle Model and Exporting the TensorRT-Optimized Program.`

			`Args:`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`model_path(str):The directory path where the PaddlePaddle model is located.`
			`The model path can either include the model directory and prefix (e.g., 'model_dir/inference'),`
			`or it can be the full path to the model (e.g., 'model_dir/inference.json').`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`config(TensorRTConfig):The configuration of TensorRTConfig.`

			`Returns:`
			`program:The TensorRT optimized program.`

			`Examples:`
[CodeStyle][DocFormat][112] Use `pycon` marker in sparse/tensorrt docs (#77975) 2026-02-19 15:10:10 +08:00			`.. code-block:: pycon`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00
			`>>> # example 1:`
			`>>> # This example takes the user-specified model input shape, and Paddle internally generates corresponding random data.`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`>>> import numpy as np`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`>>> import paddle`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`>>> import paddle.inference as paddle_infer`
			`>>> import paddle.nn.functional as F`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`>>> from paddle import nn`
			`>>> from paddle.tensorrt.export import Input, TensorRTConfig`

			`>>> class LinearNet(nn.Layer):`
			`>>> def __init__(self, input_dim):`
			`>>> super().__init__()`
			`>>> self.linear = nn.Linear(input_dim, input_dim)`

			`>>> def forward(self, x):`
			`>>> return F.relu(self.linear(x))`

			`>>> input_dim = 3`
			`>>> # 1.Instantiate the network.`
			`>>> layer = LinearNet(input_dim)`

			`>>> save_path = "/tmp/linear_net"`
			`>>> # 2.Convert dynamic graph to static graph and save as a JSON file.`
			`>>> paddle.jit.save(layer, save_path, [paddle.static.InputSpec(shape=[-1, input_dim])])`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`>>> # 3.Create TensorRTConfig`
			`>>> input_config = Input(`
			`>>> min_input_shape=[1, input_dim],`
			`>>> optim_input_shape=[2, input_dim],`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00			`>>> max_input_shape=[4, input_dim],`
			`>>> name='x',`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`>>> )`

			`>>> trt_config = TensorRTConfig(inputs=[input_config])`
			`>>> trt_config.save_model_dir = "/tmp/linear_net_trt"`

			`>>> # 4.Perform TensorRT conversion`
			`>>> program_with_trt = paddle.tensorrt.convert(save_path, trt_config)`

			`>>> # 5.Create a Predictor and run TensorRT inference.`
			`>>> config = paddle_infer.Config(`
			`>>> trt_config.save_model_dir + '.json',`
			`>>> trt_config.save_model_dir + '.pdiparams',`
			`>>> )`
			`>>> config.enable_use_gpu(100, 0)`
			`>>> predictor = paddle_infer.create_predictor(config)`

			`>>> input_data = np.random.randn(2, 3).astype(np.float32)`
			`>>> model_input = paddle.to_tensor(input_data)`

			`>>> output_converted = predictor.run([model_input])`

			`>>> # example 2:`
			`>>> # In this example, the user specifies the actual input.`
			`>>> import numpy as np`
			`>>> import paddle`
			`>>> import paddle.inference as paddle_infer`
			`>>> import paddle.nn.functional as F`
			`>>> from paddle import nn`
			`>>> from paddle.tensorrt.export import Input, TensorRTConfig`

			`>>> class LinearNet(nn.Layer):`
			`>>> def __init__(self, input_dim):`
			`>>> super().__init__()`
			`>>> self.linear = nn.Linear(input_dim, input_dim)`

			`>>> def forward(self, x):`
			`>>> return F.relu(self.linear(x))`

			`>>> input_dim = 3`
			`>>> # 1.Instantiate the network.`
			`>>> layer = LinearNet(input_dim)`

			`>>> save_path = "/tmp/linear_net"`
			`>>> # 2.Convert dynamic graph to static graph and save as a JSON file.`
			`>>> paddle.jit.save(layer, save_path, [paddle.static.InputSpec(shape=[-1, input_dim])])`

			`>>> # 3.Create TensorRTConfig`
			`>>> input_config = Input(`
			`>>> warmup_data=(`
			`>>> np.random.rand(1,3).astype(np.float32),`
			`>>> np.random.rand(2,3).astype(np.float32),`
			`>>> np.random.rand(4,3).astype(np.float32),`
【Paddle TensorRT】Modified the serialization save path for TensorRT and added an attribute name to the Input class (#71722) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * PIR-TRT is not supposed to run on Windows, but it is running. * not windows * not windows * 单测问题，跑旧ir-trt * 单测问题，跑旧ir-trt 2025-03-19 19:34:32 +08:00			`>>> ),`
			`>>> name='x',`
【Paddle TensorRT】fix document (#71470) * pd_op.linear_interp * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix 2025-03-11 13:08:48 +08:00			`>>> )`

			`>>> trt_config = TensorRTConfig(inputs=[input_config])`
			`>>> trt_config.save_model_dir = "/tmp/linear_net_trt"`

			`>>> # 4.Perform TensorRT conversion`
			`>>> program_with_trt = paddle.tensorrt.convert(save_path, trt_config)`

			`>>> # 5.Create a Predictor and run TensorRT inference.`
			`>>> config = paddle_infer.Config(`
			`>>> trt_config.save_model_dir + '.json',`
			`>>> trt_config.save_model_dir + '.pdiparams',`
			`>>> )`
			`>>> config.enable_use_gpu(100, 0)`
			`>>> predictor = paddle_infer.create_predictor(config)`

			`>>> input_data = np.random.randn(2, 3).astype(np.float32)`
			`>>> model_input = paddle.to_tensor(input_data)`

			`>>> output_converted = predictor.run([model_input])`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
			`"""`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`if os.path.abspath(config.save_model_dir) == os.path.abspath(model_path):`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`raise ValueError(`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			"The `config.save_model_dir` and `model_path` cannot be the same. Please specify a different directory for saving the model."
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`)`

			`scope = paddle.static.global_scope()`
			`place = paddle.CUDAPlace(0)`
			`exe = paddle.static.Executor(place)`

			`is_json = True`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00
			`if os.path.isfile(model_path):`
			`model_path = model_path`
			`model_dir, model_file = os.path.split(model_path)`
			`model_prefix, ext = os.path.splitext(model_file)`
			`if ext == '.json':`
			`is_json = True`
			`elif ext == '.pdmodel':`
			`is_json = False`
			`else:`
			`raise ValueError(`
			`f"Unsupported extension {ext}. Only support json/pdmodel"`
			`)`
【Paddle TensorRT】Pir-trt support TensorRT Refittable (#71501) * pir-refittable * 修改pd_op.layer_norm converter * fix * fix * fix * fix * fix * merge * Revert "fix" This reverts commit 5a4df6e4c7f4b9bd8aa6bf1ffcae22f076d20d00. * merge * fix * 功能完成 * resnet50能够refit * fix * fix * fix * fix * fix * fix * review修改完毕 * Add a new class called RefitManager that is solely used for managing content related to refitting. * add forbid_cast_op * codestyple-check * Add a convert function to ensure that the model name and weight name provided by the user are consistent, and fix the issue of duplicate naming between pd_op.batch_norm and weight_to_tensor. * fix the issue of duplicate naming * Only refit those with inputs from builtin.parameters * Fix the compilation and linking errors in test_tensorrt_engine_instruction.cc. * Remove unnecessary code from util.py * fix pd_op.fused_bias_dropout_residual_layer_norm * delete ceshi.md * deleted the extra comments * fix pd_op.batch_norm * fix * fix * fix * Added an extra unit test by mistake, please delete it. * fix pd_op.fused_conv2d_add_act * fix pd_op.fused_conv2d_add_act * Add the bias to pd_op.fused_conv2d_add_act. * The input for the bias might also be a builtin.constant * fix pd_op.fused_conv2d_add_act * fix bug * fix TestInstanceNormWith3DInputTRTPattern 2025-04-08 10:48:16 +08:00			`params_path = os.path.join(model_dir, model_prefix + '.pdiparams')`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`else:`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`model_prefix = model_path`
【Paddle TensorRT】Pir-trt support TensorRT Refittable (#71501) * pir-refittable * 修改pd_op.layer_norm converter * fix * fix * fix * fix * fix * merge * Revert "fix" This reverts commit 5a4df6e4c7f4b9bd8aa6bf1ffcae22f076d20d00. * merge * fix * 功能完成 * resnet50能够refit * fix * fix * fix * fix * fix * fix * review修改完毕 * Add a new class called RefitManager that is solely used for managing content related to refitting. * add forbid_cast_op * codestyple-check * Add a convert function to ensure that the model name and weight name provided by the user are consistent, and fix the issue of duplicate naming between pd_op.batch_norm and weight_to_tensor. * fix the issue of duplicate naming * Only refit those with inputs from builtin.parameters * Fix the compilation and linking errors in test_tensorrt_engine_instruction.cc. * Remove unnecessary code from util.py * fix pd_op.fused_bias_dropout_residual_layer_norm * delete ceshi.md * deleted the extra comments * fix pd_op.batch_norm * fix * fix * fix * Added an extra unit test by mistake, please delete it. * fix pd_op.fused_conv2d_add_act * fix pd_op.fused_conv2d_add_act * Add the bias to pd_op.fused_conv2d_add_act. * The input for the bias might also be a builtin.constant * fix pd_op.fused_conv2d_add_act * fix bug * fix TestInstanceNormWith3DInputTRTPattern 2025-04-08 10:48:16 +08:00			`params_path = model_prefix + '.pdiparams'`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`if os.path.exists(model_prefix + '.json'):`
			`is_json = True`
			`elif os.path.exists(model_prefix + '.pdmodel'):`
			`is_json = False`
			`else:`
			`raise ValueError(`
			`f"No valid model file found in the directory '{model_path}'. Expected either 'json' or 'pdmodel'. Please ensure that the directory contains one of these files."`
			`)`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00
【Paddle TensorRT】Pir-trt support TensorRT Refittable (#71501) * pir-refittable * 修改pd_op.layer_norm converter * fix * fix * fix * fix * fix * merge * Revert "fix" This reverts commit 5a4df6e4c7f4b9bd8aa6bf1ffcae22f076d20d00. * merge * fix * 功能完成 * resnet50能够refit * fix * fix * fix * fix * fix * fix * review修改完毕 * Add a new class called RefitManager that is solely used for managing content related to refitting. * add forbid_cast_op * codestyple-check * Add a convert function to ensure that the model name and weight name provided by the user are consistent, and fix the issue of duplicate naming between pd_op.batch_norm and weight_to_tensor. * fix the issue of duplicate naming * Only refit those with inputs from builtin.parameters * Fix the compilation and linking errors in test_tensorrt_engine_instruction.cc. * Remove unnecessary code from util.py * fix pd_op.fused_bias_dropout_residual_layer_norm * delete ceshi.md * deleted the extra comments * fix pd_op.batch_norm * fix * fix * fix * Added an extra unit test by mistake, please delete it. * fix pd_op.fused_conv2d_add_act * fix pd_op.fused_conv2d_add_act * Add the bias to pd_op.fused_conv2d_add_act. * The input for the bias might also be a builtin.constant * fix pd_op.fused_conv2d_add_act * fix bug * fix TestInstanceNormWith3DInputTRTPattern 2025-04-08 10:48:16 +08:00			`if not os.path.exists(params_path):`
			`raise ValueError(`
			`f"Parameters file '{params_path}' not found. Please ensure the weights file exists in the model directory."`
			`)`

[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`if is_json:`
			`with paddle.pir_utils.IrGuard():`
			`[program, feed_target_names, fetch_targets] = (`
			`paddle.static.io.load_inference_model(`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`model_path,`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`executor=exe,`
			`)`
			`)`
			`else:`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`with paddle.pir_utils.OldIrGuard():`
			`os.environ['FLAGS_enable_pir_in_executor'] = '1'`
			`[program, feed_target_names, fetch_targets] = (`
			`paddle.static.io.load_inference_model(`
			`model_path,`
			`executor=exe,`
			`)`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`)`
[Paddle TensorRT] Paddle-TensorRT support fp16 and dynamic graph APIs into static graph support predictor (#69597) * fp16 * support fp16 * fix * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * fox * fix * fix * fix * fix * fix * 暴露PrecisionMode * fix * fix * fix 2024-12-05 14:29:55 +08:00			`os.environ['FLAGS_enable_pir_in_executor'] = '0'`
[Paddle TensorRT] Support inference using predictor.run (#67755) * predictor.run跑通 * TensorRTConfig设计,跑通test_converter_model_bert.py * 可以适配多个输入的场景 * trt_op_marker跑两遍 * 删除基础pass,并修改bug * trt接口 * 消除core.py * ci超时 * fix冲突 * 暂存代码 * pir用户接口设计 * fix 注释 * fix 注释 * fix * delete ununsed code * 解决了动转静scope找不到var的问题 * test_converter_model_cumsum.py -> test_converter_export.py * 增加了动转静单测 * 删除get_program函数 * 消除冲突+export接口测试跑通 * 提交用户接口文档 * 修改接口命名和文档 * 删除中文注释 * test_export改名test_convert * 代码检查 * 修改input_data的描述 * 修改测试名称以及修改单侧名称 * 修复test_converter中没适配warm_up_shape导致的问题相对路径下报错，并修改只能load json的问题 * fix * 修复pd_op.fetch获取的位置 * 修改注释 * 冲突 * 修改 2024-09-14 13:47:40 +08:00			`return convert_to_trt(program, config, scope)`