Blame: python/paddle/nn/functional/__init__.py - PaddlePaddle/Paddle

PaddlePaddle / Paddle UNCLAIMED

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

23800 0 0 C++

Normal View History Raw

define alias in paddle dir and paddle.nn dir test=develop (#23282) * define alias in paddle dir and paddle.nn dir test=develop * define alias in paddle.nn.functional dir test=develop * define alias in paddle.tensor.__init__.py test=develop 2020-03-30 10:41:53 +08:00			`# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.`
			`#`
			`# Licensed under the Apache License, Version 2.0 (the "License");`
			`# you may not use this file except in compliance with the License.`
			`# You may obtain a copy of the License at`
			`#`
			`# http://www.apache.org/licenses/LICENSE-2.0`
			`#`
			`# Unless required by applicable law or agreed to in writing, software`
			`# distributed under the License is distributed on an "AS IS" BASIS,`
			`# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.`
			`# See the License for the specific language governing permissions and`
			`# limitations under the License.`

Add functional convolutions in paddle.nn.functional (#23408) * add functional conv * add test and doc for function convs, test=develop * update ConvTransposeOp's InferShape and error message, test=develop 2020-04-04 03:04:07 +08:00			`# TODO: import all neural network related api under this directory,`
define alias in paddle dir and paddle.nn dir test=develop (#23282) * define alias in paddle dir and paddle.nn dir test=develop * define alias in paddle.nn.functional dir test=develop * define alias in paddle.tensor.__init__.py test=develop 2020-03-30 10:41:53 +08:00			`# including layers, linear, conv, rnn etc.`

[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`from .activation import (`
			`celu,`
			`elu,`
			`elu_,`
			`gelu,`
			`glu,`
			`gumbel_softmax,`
			`hardshrink,`
			`hardsigmoid,`
			`hardswish,`
			`hardtanh,`
			`hardtanh_,`
			`leaky_relu,`
			`leaky_relu_,`
			`log_sigmoid,`
			`log_softmax,`
			`maxout,`
			`mish,`
			`prelu,`
			`relu,`
			`relu6,`
			`relu_,`
			`rrelu,`
			`selu,`
			`sigmoid,`
			`silu,`
			`softmax,`
			`softmax_,`
			`softplus,`
			`softshrink,`
			`softsign,`
Migrate swiglu from incubate to nn.functional[fluid_ops] (#76833) * Migrate swiglu from incubate * using swiglu in test * fix * fix docs * rm swiglu in incubate * fix error * recover swiglu in incubate * add deprecated * fix * add deprecated * Remove 'swiglu' from functional exports Removed 'swiglu' from the list of functional exports. 2025-12-17 15:32:02 +08:00			`swiglu,`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`swish,`
			`tanh,`
			`tanh_,`
			`tanhshrink,`
			`thresholded_relu,`
			`thresholded_relu_,`
			`)`
			`from .common import (`
			`alpha_dropout,`
			`bilinear,`
			`class_center_sample,`
			`cosine_similarity,`
			`dropout,`
add paddle nn.functional.dropout1d api (#74444) 2025-08-08 16:32:57 +08:00			`dropout1d,`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`dropout2d,`
			`dropout3d,`
【Hackathon 6th No.8】NO.8 为 Paddle 新增 FeatureAlphaDropout API (#64881) * [Add] feature_alpha_dropout * [Add] FeatureAlphaDropout * [Change] float \| int to float * [Change] feature alpha dropout unittest file * [Update] unittest ndim errors * [Update] type annotations * Revert "[Update] type annotations" This reverts commit b79f264ae37879f69a16afe1b1a1a39c8a8f5a87. * [Update] type annotation * [Update] type annotation * [Update] merge develop 2024-06-28 12:04:39 +08:00			`feature_alpha_dropout,`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`fold,`
			`interpolate,`
			`label_smooth,`
			`linear,`
			`pad,`
			`unfold,`
			`upsample,`
			`zeropad2d,`
			`)`
			`from .conv import (`
			`conv1d,`
			`conv1d_transpose,`
			`conv2d,`
			`conv2d_transpose,`
			`conv3d,`
			`conv3d_transpose,`
			`)`
[CodeStyle][ruff] clean some F401 step: 7 (#60083) 2023-12-17 15:19:39 +08:00			`from .distance import pairwise_distance, pdist # noqa: F401`
[CodeStyle][ruff] clean some F401 step: 4 (#59381) 2023-11-28 10:17:32 +08:00			`from .extension import (`
			`diag_embed, # noqa: F401`
			`gather_tree,`
			`sequence_mask,`
			`temporal_shift,`
			`)`
[CodeStyle][ruff] clean some F401 step: 7 (#60083) 2023-12-17 15:19:39 +08:00			`from .flash_attention import (`
supports fa3_varlen api (#72805) * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api 2025-05-30 14:42:16 +08:00			`flash_attention_v3_varlen,`
Flashattention support qkvpacked and varlen (#63289) * Flashattention support qkvpacked and varlen * fix codestyle * fix codestyle * FlashAttention kvReduceGQA Performance Optimization * Fix problem with windows * code clean * update third_party/flashattn * update errormsg and docs * update api * update doc * update doctest * update doc, test=document_fix * update doc, test=document_fix * Update python/paddle/nn/functional/flash_attention.py Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> * Update python/paddle/nn/functional/flash_attention.py Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> * update doc --------- Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> 2024-05-07 20:26:48 +08:00			`flash_attn_qkvpacked,`
			`flash_attn_varlen_qkvpacked,`
Add Flashmask (#65409) * Add Flashmask * fix flash_attn_kernel.h * fix bug * flashmask support downend * update * add unittest * assertion for ending row index * fix headdim 32,64 for cuda12.3 * fix flash_attn * fix bug * remove downend assertion, enable downend * update doc, remove return_softmax * enable flashmask unittest * update flashattn * fix bug * flashmask support window and varlen * use varlen * add upend * fix enforce * fix enforce, update flashattn * fix window * update flashattn * fix code style * Refine. * Refine. --------- Co-authored-by: umiswing <umiswing@foxmail.com> 2024-08-26 11:42:42 +08:00			`flashmask_attention,`
[CodeStyle][ruff] clean some F401 step: 7 (#60083) 2023-12-17 15:19:39 +08:00			`sdp_kernel, # noqa: F401`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`)`
【Hackathon 6th No.23】为 paddle.nn.functional.embedding/paddle.nn.Embedding 增加参数 max_norm/norm_type/scale_grad_by_freq - part (#63130) * Add embedding_renorm_ function to functional module * Fix embedding_renorm_ function signature * Add max_norm and norm_type parameters to Embedding class * Remove scale_grad_by_freq parameter from embedding function * remove scale_grad_by_freq parameter * add test case * refactor: use fabs and powf functions * refactor: use api to build embedding_renorm * remove embedding_renorm_ function from __all__ * fix * add static test 2024-07-29 10:29:48 +08:00			`from .input import (`
			`embedding,`
			`embedding_renorm_, # noqa: F401`
			`one_hot,`
			`)`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`from .loss import (`
【Hackathon 6th No.1】Add AdaptiveLogSoftmaxWithLoss API to Paddle -part (#63302) * Add AdaptiveLogSoftmaxWithLoss API * update codestyle * update loss * test * update test * add weight_attr * update forward * update forward * update * update * update * update test_gard * update * update information * update * update * codestyle * update * update * update * update 2024-05-23 14:49:14 +08:00			`adaptive_log_softmax_with_loss,`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`binary_cross_entropy,`
			`binary_cross_entropy_with_logits,`
			`cosine_embedding_loss,`
			`cross_entropy,`
			`ctc_loss,`
			`dice_loss,`
			`gaussian_nll_loss,`
			`hinge_embedding_loss,`
			`hsigmoid_loss,`
			`kl_div,`
			`l1_loss,`
			`log_loss,`
			`margin_cross_entropy,`
			`margin_ranking_loss,`
			`mse_loss,`
新增MultiLabelMarginLoss (#73538) 2025-07-09 21:26:06 +08:00			`multi_label_margin_loss,`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`multi_label_soft_margin_loss,`
			`multi_margin_loss,`
			`nll_loss,`
			`npair_loss,`
			`poisson_nll_loss,`
			`rnnt_loss,`
			`sigmoid_focal_loss,`
			`smooth_l1_loss,`
			`soft_margin_loss,`
			`softmax_with_cross_entropy,`
			`square_error_cost,`
			`triplet_margin_loss,`
			`triplet_margin_with_distance_loss,`
			`)`
Move fused_moe_permute and fused_moe_unpermute (#73264) * init unzip * finish unzip v1 * init zip * init zip and unzip * finish unzip * add zip kernel * add api * modify cmakelists for new op * add unit test * pre commit * fix bugs * replace bf162 to custom_bf162 * add __custom_hadd * fix bugs * fix cmakelists * fix int to bf16 bug * fix cmakelists * update comment * update * modify python entry * delete comment * fix bugs * update unit test * update * reduce datatype * fix comment * restrict datatype 2025-06-15 18:04:05 +08:00			`from .moe_permute import moe_permute`
			`from .moe_unpermute import moe_unpermute`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`from .norm import (`
			`batch_norm,`
API improvement nn.functional.group_norm 易用性提升 (#62672) * add nn.functional.group_norm * fix docs * fix docs 2024-03-25 11:21:06 +08:00			`group_norm,`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`instance_norm,`
			`layer_norm,`
			`local_response_norm,`
			`normalize,`
add paddle.nn.functional.rms_norm (#76975) * add init rms_norm * fix test_rms_norm_xpu * fix * fix2 2025-12-19 17:17:28 +08:00			`rms_norm,`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`)`
			`from .pooling import (`
			`adaptive_avg_pool1d,`
			`adaptive_avg_pool2d,`
			`adaptive_avg_pool3d,`
			`adaptive_max_pool1d,`
			`adaptive_max_pool2d,`
			`adaptive_max_pool3d,`
			`avg_pool1d,`
			`avg_pool2d,`
			`avg_pool3d,`
【Hackathon 5th No.38】为 Paddle 新增 FractionalMaxPool2d / FractionalMaxPool3d API -kernel (#59847) * [Init] add fractional max pool kernel and api * [Fix] pooling.cu seed offset * [Change] remove adaptive from fractional max pool * [Change] fractional max 2d gpu pooling.cu grad * [Change] fractional max 2d gpu pooling.cu grad with dim3 * [Change] use UnchangedInferMeta * [Change] test api with uint16 * [Change] wrap test disable_static * [Change] regiester float16/bfloat16 * [Change] remove bfloat16 from cpu kernrl * [Change] test dtypes in cpu and gpu * [Change] test_fractional_max_pool3d_2d/3d timeout to 30s * [Fix] resolve conflict * [Change] win32 cannot detect bfloat16 correctly * [Change] force set_device * [Add] test random_u is None * [Change] use kernel_size for overlapping mode * [Change] clean headers * [CodeStyle] pooling * [Change] rename op * [Change] rename func without index 2024-01-12 16:55:20 +08:00			`fractional_max_pool2d,`
			`fractional_max_pool3d,`
【Hackathon 6th No.16】为 Paddle 新增 LPPool1D / LPPool2D API (#63544) * fix conflict * fix build error * fix * fix functional.lp_pool2d * fix zero norm_type * fix exampler and enable static test * Add lp_pool1d and remove norm_type==0 * update * Add lp_pool op test * fix dcu * support data_format in lp_pool1d 2024-06-07 15:15:16 +08:00			`lp_pool1d,`
			`lp_pool2d,`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`max_pool1d,`
			`max_pool2d,`
			`max_pool3d,`
			`max_unpool1d,`
			`max_unpool2d,`
			`max_unpool3d,`
			`)`
[API Compatibility] Add paddle.compat.nn.functional.sdpa (#76446) * Implement paddle.nn.functional.sdpa * Enable flash attention test and disable test_compat_attention on Windows * Refactor sdpa * check dtype for mem_efficient_attention, support 3d attn_mask, refine tests * fix test_flash_attn error * feat: refactor GQA implementation and improve tensor handling - Move GQA logic from compat module to main scaled_dot_product_attention function - Add enable_gqa parameter to function signature with proper documentation - Simplify tensor dimension handling using is_batched check - Remove duplicate GQA validation and expansion code from compat module - Improve code organization by centralizing GQA functionality in main implementation * feat(test): update attention test shape for better alignment Change the shape parameter in TestSDPAttentionWithScale from (2, 32, 8, 32) to (2, 8, 8, 32) to improve test alignment and ensure proper attention mechanism validation with more realistic tensor dimensions. * feat(attention): update documentation and mask handling logic - Update scaled_dot_product_attention documentation to clarify dtype support and remove GQA mode mention - Simplify mask padding logic in MultiheadAttention to always use input dtype - Add tensor shape comments for better code readability - Refactor attention mask generation logic to improve efficiency - Remove unused device capability checking functions These changes improve code clarity and maintainability while ensuring consistent behavior across different input types. * feat(transformer): initialize bias parameters with None and conditionally create bias parameters Initialize all bias parameters (in_proj_bias, q_proj_bias, k_proj_bias, v_proj_bias) to None at class initialization. Conditionally create bias parameters only when bias=True, moving the bias parameter creation logic to the appropriate conditional branches. This improves code clarity by ensuring bias parameters are always defined and only created when needed. * feat(nn): remove __all__ from compat nn module * feat: fix CUDA availability check in scaled dot product attention Change `paddle.device.is_available()` to `paddle.cuda.is_available()` in the CUDA availability check function. This ensures proper detection of CUDA availability specifically for GPU operations in the scaled dot product attention implementation. * feat: update shape output format in docstrings and rename attention module - Change shape output format from list to paddle.Size in AvgPool1D, AvgPool2D, AvgPool3D, and Unfold docstrings - Rename attention.py to sdpa.py and update import paths - Remove debug parameter from check_all_tensors_on_device function - Replace debug warning with info logging for tensor device placement checks - Update MultiheadAttention documentation regarding optimized implementation conditions * feat: reduce log verbosity in attention validation functions Changed logger calls from info to debug level in SDPA validation functions to reduce noise in production logs. This maintains the same validation logic but only shows detailed validation messages when debug logging is enabled. * feat: add bfloat16 support check for MHA tests on CUDA Add paddle.device.is_bf16_supported() check to ensure bfloat16 tests only run on CUDA devices that support bfloat16. This prevents test failures on CUDA devices without bfloat16 support by falling back to float32 dtype in those cases. * feat: add runtime flags for attention backends and fix bf16 support check - Add FLAGS_memory_efficient_attention_available and FLAGS_flash_attention_available to conditionally enable attention backends at runtime - Update SDPA backend selection to use runtime flags instead of hardcoded values - Fix bf16 support detection in multihead attention tests by checking CUDA compute capability - Remove redundant scale check in flash attention constraints - Improve test coverage by using consistent bf16 capability checks * feat: add global flags for attention kernel availability Add global boolean flags `memory_efficient_attention_available` and `flash_attention_available` to centralize availability checks for memory efficient and flash attention kernels. Move flag definitions from individual kernel files to flags.cc for better maintainability and to avoid code duplication. The flags automatically set to true when corresponding compilation macros (PADDLE_WITH_MEMORY_EFFICIENT_ATTENTION and PADDLE_WITH_FLASHATTN) are defined, allowing runtime detection of available attention implementations. * Fix compile error on windows * Fix build error * feat(nn): use safe dict get for attention backend flags Replace direct dictionary access with get() method to handle missing flags gracefully. This prevents KeyError exceptions when the global flags dictionary doesn't contain the expected flash attention and memory efficient attention availability flags, providing default False values instead. 2025-11-26 13:25:39 +08:00			`from .sdpa import scaled_dot_product_attention`
Add nn.functional.sparse_attention and some test cases, test=develop (#35757) Add paddle.nn.functional.sparse_attention API 本个PR主要将sparse_attention功能在python层进行了一层封装，OP的主体代码见：#PR35676 此外，对于封装的python 接口，增加了相应的单测。 2021-10-11 19:53:53 +08:00			`from .sparse_attention import sparse_attention`
[CodeStyle][ruff] clean some F401 step: 3 (#58306) 2023-11-07 11:14:06 +08:00			`from .vision import (`
			`affine_grid,`
			`channel_shuffle,`
			`grid_sample,`
			`pixel_shuffle,`
			`pixel_unshuffle,`
			`)`
Add nn.functional.sparse_attention and some test cases, test=develop (#35757) Add paddle.nn.functional.sparse_attention API 本个PR主要将sparse_attention功能在python层进行了一层封装，OP的主体代码见：#PR35676 此外，对于封装的python 接口，增加了相应的单测。 2021-10-11 19:53:53 +08:00
[API Compatibility] Add paddle.Tensor.clamp_ ，paddle.nn.functional.logsigmoid， paddle.functional.meshgrid， paddle.nn.init.calculate_fan_in_and_fan_out ，paddle.autocast (#76206) * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * support recompute's forward and backward in pipeline mode * [API Compatibility] Add paddle.Tensor.clip_ * Revert "support recompute's forward and backward in pipeline mode" This reverts commit 7fd48d9060b292136bce9cdf79983530d5c5d52f. * Revert "[API Compatibility] Add paddle.Tensor.clip_" This reverts commit 025efc33f3daad27e6b8eda75d032c91c1a7a020. * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast 2025-11-06 14:13:32 +08:00			`logsigmoid = log_sigmoid`
[API Compatibility] Add conv_transpose decorator and alias, test conv2d_transpose only -part (#78475) * Add conv_transpose decorator and alias, test conv2d_transpose only Split from #78222 to isolate Windows-inference CI crash (0xc0000409). This PR tests only conv2d_transpose compatibility. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * add conv2d_transpose compatibility test Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix code style * reduce conv2d_transpose test tensor sizes to avoid Windows-Inference crash Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> 2026-03-31 05:10:34 -07:00			`conv_transpose1d = conv1d_transpose`
			`conv_transpose2d = conv2d_transpose`
			`conv_transpose3d = conv3d_transpose`
[codestyle][ruff] enable `PGH004` (#57941) * [codestyle] enable PGH004 * Update pyproject.toml Co-authored-by: Nyakku Shigure <sigure.qaq@gmail.com> --------- Co-authored-by: Nyakku Shigure <sigure.qaq@gmail.com> 2023-10-09 18:43:10 +08:00			`__all__ = [`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'celu',`
			`'conv1d',`
			`'conv1d_transpose',`
			`'conv2d',`
			`'conv2d_transpose',`
			`'conv3d',`
			`'conv3d_transpose',`
[API Compatibility] Add conv_transpose decorator and alias, test conv2d_transpose only -part (#78475) * Add conv_transpose decorator and alias, test conv2d_transpose only Split from #78222 to isolate Windows-inference CI crash (0xc0000409). This PR tests only conv2d_transpose compatibility. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * add conv2d_transpose compatibility test Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix code style * reduce conv2d_transpose test tensor sizes to avoid Windows-Inference crash Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> 2026-03-31 05:10:34 -07:00			`'conv_transpose1d',`
			`'conv_transpose2d',`
			`'conv_transpose3d',`
【PaddlePaddle Hackathon 3 No.12】为 Paddle 新增 pairwise_distance (#44161) * add paddle.nn.functional.pairwise_distance (cattidea/Paddle#273) * remove the test case for undefined behavior Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2022-07-29 23:35:12 +08:00			`'pairwise_distance',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'elu',`
			`'elu_',`
			`'gelu',`
			`'hardshrink',`
			`'hardtanh',`
【inplace api】batch add inplace api paddle.log_, paddle.i0_, paddle.nn.functional.leaky_relu_... (#55576) * batch add inplace api * add inplace test * add activation inplace * fix test * remove atan2 ge, gt, le, lt, nq * remove atan2 ge, gt, le, lt, nq * fix windows ci error * rerun ci * fix typro * fix bugs --------- Co-authored-by: zhangrui34 <v_zhangrui34@baidu.com> 2023-07-27 15:33:52 +08:00			`'hardtanh_',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'hardsigmoid',`
			`'hardswish',`
			`'leaky_relu',`
【inplace api】batch add inplace api paddle.log_, paddle.i0_, paddle.nn.functional.leaky_relu_... (#55576) * batch add inplace api * add inplace test * add activation inplace * fix test * remove atan2 ge, gt, le, lt, nq * remove atan2 ge, gt, le, lt, nq * fix windows ci error * rerun ci * fix typro * fix bugs --------- Co-authored-by: zhangrui34 <v_zhangrui34@baidu.com> 2023-07-27 15:33:52 +08:00			`'leaky_relu_',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'log_sigmoid',`
[API Compatibility] Add paddle.Tensor.clamp_ ，paddle.nn.functional.logsigmoid， paddle.functional.meshgrid， paddle.nn.init.calculate_fan_in_and_fan_out ，paddle.autocast (#76206) * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * sharding stage3 bugfix * support recompute's forward and backward in pipeline mode * [API Compatibility] Add paddle.Tensor.clip_ * Revert "support recompute's forward and backward in pipeline mode" This reverts commit 7fd48d9060b292136bce9cdf79983530d5c5d52f. * Revert "[API Compatibility] Add paddle.Tensor.clip_" This reverts commit 025efc33f3daad27e6b8eda75d032c91c1a7a020. * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast * [API Compatibility] Add clip_、logsigmoid、_calculate_fan_in_and_fan_out、meshgrid、autocast 2025-11-06 14:13:32 +08:00			`'logsigmoid',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'maxout',`
			`'prelu',`
			`'relu',`
			`'relu_',`
			`'relu6',`
			`'selu',`
			`'softmax',`
			`'softmax_',`
			`'softplus',`
			`'softshrink',`
			`'softsign',`
			`'sigmoid',`
			`'silu',`
Migrate swiglu from incubate to nn.functional[fluid_ops] (#76833) * Migrate swiglu from incubate * using swiglu in test * fix * fix docs * rm swiglu in incubate * fix error * recover swiglu in incubate * add deprecated * fix * add deprecated * Remove 'swiglu' from functional exports Removed 'swiglu' from the list of functional exports. 2025-12-17 15:32:02 +08:00			`'swiglu',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'swish',`
			`'mish',`
			`'tanh',`
			`'tanh_',`
			`'tanhshrink',`
			`'thresholded_relu',`
【inplace api】batch add inplace api paddle.log_, paddle.i0_, paddle.nn.functional.leaky_relu_... (#55576) * batch add inplace api * add inplace test * add activation inplace * fix test * remove atan2 ge, gt, le, lt, nq * remove atan2 ge, gt, le, lt, nq * fix windows ci error * rerun ci * fix typro * fix bugs --------- Co-authored-by: zhangrui34 <v_zhangrui34@baidu.com> 2023-07-27 15:33:52 +08:00			`'thresholded_relu_',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'log_softmax',`
			`'glu',`
			`'gumbel_softmax',`
			`'sequence_mask',`
			`'dropout',`
add paddle nn.functional.dropout1d api (#74444) 2025-08-08 16:32:57 +08:00			`'dropout1d',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'dropout2d',`
			`'dropout3d',`
			`'alpha_dropout',`
【Hackathon 6th No.8】NO.8 为 Paddle 新增 FeatureAlphaDropout API (#64881) * [Add] feature_alpha_dropout * [Add] FeatureAlphaDropout * [Change] float \| int to float * [Change] feature alpha dropout unittest file * [Update] unittest ndim errors * [Update] type annotations * Revert "[Update] type annotations" This reverts commit b79f264ae37879f69a16afe1b1a1a39c8a8f5a87. * [Update] type annotation * [Update] type annotation * [Update] merge develop 2024-06-28 12:04:39 +08:00			`'feature_alpha_dropout',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'label_smooth',`
			`'linear',`
			`'pad',`
			`'zeropad2d',`
			`'unfold',`
			`'interpolate',`
			`'upsample',`
			`'bilinear',`
			`'cosine_similarity',`
			`'avg_pool1d',`
			`'avg_pool2d',`
			`'avg_pool3d',`
【Hackathon 6th No.16】为 Paddle 新增 LPPool1D / LPPool2D API (#63544) * fix conflict * fix build error * fix * fix functional.lp_pool2d * fix zero norm_type * fix exampler and enable static test * Add lp_pool1d and remove norm_type==0 * update * Add lp_pool op test * fix dcu * support data_format in lp_pool1d 2024-06-07 15:15:16 +08:00			`'lp_pool1d',`
			`'lp_pool2d',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'max_pool1d',`
			`'max_pool2d',`
			`'max_pool3d',`
			`'max_unpool1d',`
			`'max_unpool2d',`
			`'max_unpool3d',`
cherry-pick fleety's customized moe_permute optimization (#74979) * cherry-pick fleety * fix miscs * recover fp16 * fix miscs 2025-08-30 02:01:44 +08:00			`'moe_permute',`
			`'moe_unpermute',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'adaptive_avg_pool1d',`
			`'adaptive_avg_pool2d',`
			`'adaptive_avg_pool3d',`
			`'adaptive_max_pool1d',`
			`'adaptive_max_pool2d',`
			`'adaptive_max_pool3d',`
【Hackathon 5th No.38】为 Paddle 新增 FractionalMaxPool2d / FractionalMaxPool3d API -kernel (#59847) * [Init] add fractional max pool kernel and api * [Fix] pooling.cu seed offset * [Change] remove adaptive from fractional max pool * [Change] fractional max 2d gpu pooling.cu grad * [Change] fractional max 2d gpu pooling.cu grad with dim3 * [Change] use UnchangedInferMeta * [Change] test api with uint16 * [Change] wrap test disable_static * [Change] regiester float16/bfloat16 * [Change] remove bfloat16 from cpu kernrl * [Change] test dtypes in cpu and gpu * [Change] test_fractional_max_pool3d_2d/3d timeout to 30s * [Fix] resolve conflict * [Change] win32 cannot detect bfloat16 correctly * [Change] force set_device * [Add] test random_u is None * [Change] use kernel_size for overlapping mode * [Change] clean headers * [CodeStyle] pooling * [Change] rename op * [Change] rename func without index 2024-01-12 16:55:20 +08:00			`'fractional_max_pool2d',`
			`'fractional_max_pool3d',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'binary_cross_entropy',`
			`'binary_cross_entropy_with_logits',`
			`'cross_entropy',`
			`'dice_loss',`
			`'hsigmoid_loss',`
			`'kl_div',`
			`'l1_loss',`
			`'log_loss',`
			`'mse_loss',`
			`'margin_ranking_loss',`
[PaddleHackathon No.14] (#41183) * 2022-04-28 * 2022-05-04 * 2022-05-05_V1 * 2022-05-05_V1 * Update loss.py * Update loss.py * 2022-06-01_hook * 2022-06-05 * 2022-06-07 * 2022-06-07_V2 * 2022-06-07_V2 * 2022-06-17_codestyle 2022-06-17 16:35:36 +08:00			`'multi_label_soft_margin_loss',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'nll_loss',`
【Hackathon No.16】add PoissonNLLLoss API (#51117) * add PoissonNLLLoss API * update unittests * Fix poisson_nll_loss init and update data type support * remove type comment * Update doc string * Fix doc string erro * Fix doc string math equation format * Add float16 and bfloat16 support 2023-04-10 16:47:42 +08:00			`'poisson_nll_loss',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'npair_loss',`
			`'sigmoid_focal_loss',`
			`'smooth_l1_loss',`
			`'softmax_with_cross_entropy',`
			`'margin_cross_entropy',`
			`'square_error_cost',`
			`'ctc_loss',`
add rnn-t loss and api (#49199) * add warp transducer code 2022-12-23 14:14:01 +08:00			`'rnnt_loss',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'hinge_embedding_loss',`
			`'affine_grid',`
			`'grid_sample',`
			`'local_response_norm',`
			`'pixel_shuffle',`
			`'pixel_unshuffle',`
			`'channel_shuffle',`
			`'embedding',`
			`'gather_tree',`
			`'one_hot',`
			`'normalize',`
			`'temporal_shift',`
			`'batch_norm',`
			`'layer_norm',`
add paddle.nn.functional.rms_norm (#76975) * add init rms_norm * fix test_rms_norm_xpu * fix * fix2 2025-12-19 17:17:28 +08:00			`'rms_norm',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'instance_norm',`
			`'class_center_sample',`
			`'sparse_attention',`
			`'fold',`
【Hackathon No.17】为 Paddle 新增 paddle.nn.CosineEmbeddingLoss 和 paddle.nn.functional.cosine_embedding_loss API (#41680) * add cosine embedding loss API * new version * new version * new version * set label to int32 * new version * new version-test * new version * new version * new version * new version * new version * new version * new version * new version * new version * new version * new version * new version * new version * new version * new version * new version * aligning to Chinese document * add name parameter * activate CI * fix format error * unit test code format * format code 2022-06-08 15:01:44 +08:00			`'cosine_embedding_loss',`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00			`'rrelu',`
【Hachathon No.30】 (#40545) * 'TripletMarginDistanceLoss' * 'test_file' * '2022_03_27' * 2022-03-31 * 2022-04-05 * 2 * 2022-04-17 * 2022-04-17_2 * 2022-04-17_3 * 2022-04-17_4 * 2022-04-25 * 2022-05-02_V1 * 2022-05-06_V1 * 2022-05-07_V1 * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * 2022-06-01_pre-commit * 2022-06-05 * 2022-06-06 * 2022-06-07 * 2022-06-07_V2 2022-06-13 10:58:07 +08:00			`'triplet_margin_with_distance_loss',`
[Hackathon No.26] (#40487) * 'triplet_margin_loss' * 'test_file_corret' * '2022_03_27' * 2022_04_05 * 2022-04-17_1 * 2022-04-17 * 2022-04-17_2 * 2022-04-25 * 2022-05-02_V1 * 2022-05-06_V1 * 2022-05-07_V1 * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update loss.py * Update test_triplet_margin_loss.py * Update loss.py * 2022-06-01_pre-commit * 2022-06-05 * 2022-06-06 * 2022-06-06 * code_style_check * code_style_check * Update loss.py * 2022-06-07_V2 * Update loss.py * Update loss.py 2022-06-17 09:38:00 +08:00			`'triplet_margin_loss',`
【Hackathon 6th No.1】Add AdaptiveLogSoftmaxWithLoss API to Paddle -part (#63302) * Add AdaptiveLogSoftmaxWithLoss API * update codestyle * update loss * test * update test * add weight_attr * update forward * update forward * update * update * update * update test_gard * update * update information * update * update * codestyle * update * update * update * update 2024-05-23 14:49:14 +08:00			`'adaptive_log_softmax_with_loss',`
【Paddle Hackathon No.11】 (#45595) * 2022-08-30_update nn.layer.loss nn.functional.loss, test_file * 2022-08-30_update nn.layer.loss nn.functional.loss, test_file * fix: test_file * fix: test_file, docs, multi_margin_loss * fix: doc weight function * fix: test_multi_margin_loss * fix: weight np.testing.assert_allclose * fix: test_file * fix: en_doc * 2022-10-10 2022-10-13 11:00:17 +08:00			`'multi_margin_loss',`
新增MultiLabelMarginLoss (#73538) 2025-07-09 21:26:06 +08:00			`'multi_label_margin_loss',`
【Hackathon No.21】为 Paddle 新增 SoftMarginLoss (#42364) * 2022-04-28 * 2022-04-28_V2 * 2022-04-30 * 2022-04-30_V2 * 2022-05-01 * 2022-05-02 * 2022-05-02_V2 * 2022-05-05_V1 * 2022-05-06_V1 * 2022-05-07_V1 * Update loss.py * 2022-05-07_V2 * 2022-05-13_V1 * Update test_soft_margin_loss.py * Update loss.py * Update loss.py * 2022-05-16_V1 * 2022-05-19_V1 * 2022-05-20_V1 * Update test_soft_margin_loss.py * 2022-06-01_V1 * 2022-06-05 * 2022-06-07 * 2022-06-07 * 2022-06-08 * 2022-06-08_V2 * 2022-06-17-code_style * Modify python * 2022-06-20 * for * for CI;test=document_fix Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com> 2022-07-25 17:18:38 +08:00			`'soft_margin_loss',`
Add GaussianNLLLoss API. (#50843) * Add GaussianNLLLoss API. * Change `rotl` `atol`.Check `var` in dynamic graph * remove assertTrue * update unittest * update unittest for ci-covarage.add broadcast with same dim. * Supply static err print. * Repair note and example. * Split unitest. * empty commit. * for standard commit. * for standard commit. * Add int dynamic graph test. * Repair parameters name. * Repair unitest parameters name. * Repair unitest parameters name * Repair unitest parameters name * Repair unitest parameters name * add square in code-block * fit few notes. * fit few notes. * fit few notes. * fit few notes. * add few interpretations. * add few interpretations. * add few interpretations. * fix import. * fix space. * empty commit for ci. 2023-04-13 17:10:18 +08:00			`'gaussian_nll_loss',`
Add scaled_dot_product_attention api (#55242) 2023-08-02 10:27:04 +08:00			`'scaled_dot_product_attention',`
Add Flashmask (#65409) * Add Flashmask * fix flash_attn_kernel.h * fix bug * flashmask support downend * update * add unittest * assertion for ending row index * fix headdim 32,64 for cuda12.3 * fix flash_attn * fix bug * remove downend assertion, enable downend * update doc, remove return_softmax * enable flashmask unittest * update flashattn * fix bug * flashmask support window and varlen * use varlen * add upend * fix enforce * fix enforce, update flashattn * fix window * update flashattn * fix code style * Refine. * Refine. --------- Co-authored-by: umiswing <umiswing@foxmail.com> 2024-08-26 11:42:42 +08:00			`'flashmask_attention',`
Flashattention support qkvpacked and varlen (#63289) * Flashattention support qkvpacked and varlen * fix codestyle * fix codestyle * FlashAttention kvReduceGQA Performance Optimization * Fix problem with windows * code clean * update third_party/flashattn * update errormsg and docs * update api * update doc * update doctest * update doc, test=document_fix * update doc, test=document_fix * Update python/paddle/nn/functional/flash_attention.py Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> * Update python/paddle/nn/functional/flash_attention.py Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> * update doc --------- Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> 2024-05-07 20:26:48 +08:00			`'flash_attn_qkvpacked',`
supports fa3_varlen api (#72805) * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api * supports fa3_varlen api 2025-05-30 14:42:16 +08:00			`"flash_attention_v3_varlen",`
Flashattention support qkvpacked and varlen (#63289) * Flashattention support qkvpacked and varlen * fix codestyle * fix codestyle * FlashAttention kvReduceGQA Performance Optimization * Fix problem with windows * code clean * update third_party/flashattn * update errormsg and docs * update api * update doc * update doctest * update doc, test=document_fix * update doc, test=document_fix * Update python/paddle/nn/functional/flash_attention.py Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> * Update python/paddle/nn/functional/flash_attention.py Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> * update doc --------- Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> 2024-05-07 20:26:48 +08:00			`'flash_attn_varlen_qkvpacked',`
API improvement nn.functional.group_norm 易用性提升 (#62672) * add nn.functional.group_norm * fix docs * fix docs 2024-03-25 11:21:06 +08:00			`'group_norm',`
update 2.0 public api in nn (#31912) * update 2.0 public api in nn * replace Chinese character cause error in ci;synchronization with pr:#32588 to avoid 'ascii' codec in python2 * numbers used in paddle.nn.functional.norm but not imported 2021-04-27 18:57:32 +08:00			`]`