Blame: python/paddle/distributed/communication/send.py - PaddlePaddle/Paddle

PaddlePaddle / Paddle UNCLAIMED

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

0 0 1 C++

Normal View History Raw

move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00			`# Copyright (c) 2022 PaddlePaddle Authors. All Rights Reserved.`
			`#`
			`# Licensed under the Apache License, Version 2.0 (the "License");`
			`# you may not use this file except in compliance with the License.`
			`# You may obtain a copy of the License at`
			`#`
			`# http://www.apache.org/licenses/LICENSE-2.0`
			`#`
			`# Unless required by applicable law or agreed to in writing, software`
			`# distributed under the License is distributed on an "AS IS" BASIS,`
			`# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.`
			`# See the License for the specific language governing permissions and`
			`# limitations under the License.`

[Typing][C-21,C-22,C-23,C-24] Add type annotations for `python/paddle/distributed/communication/{reduce_scatter.py, scatter.py, send.py, all_gather.py}` (#66864) --------- Co-authored-by: megemini <megemini@outlook.com> Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-08-06 09:58:04 +08:00			`from __future__ import annotations`

[AutoParallel] send/recv_object_list function and serialize method for placement object (#72098) * add___reduce___method * support send/recv_object_list * polish api and add tests * typing-fix * typing-fix * fix_tests --------- Co-authored-by: zty-king <17786324919@163.com> 2025-05-08 20:50:51 +08:00			`from typing import TYPE_CHECKING, Any`
[Typing][C-21,C-22,C-23,C-24] Add type annotations for `python/paddle/distributed/communication/{reduce_scatter.py, scatter.py, send.py, all_gather.py}` (#66864) --------- Co-authored-by: megemini <megemini@outlook.com> Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-08-06 09:58:04 +08:00
[AutoParallel] send/recv_object_list function and serialize method for placement object (#72098) * add___reduce___method * support send/recv_object_list * polish api and add tests * typing-fix * typing-fix * fix_tests --------- Co-authored-by: zty-king <17786324919@163.com> 2025-05-08 20:50:51 +08:00			`import paddle`
[CodeStyle][PLR0402] import a.b to from a import b (#52125) 2023-03-25 15:29:52 +08:00			`from paddle.distributed.communication import stream`
[AutoParallel] send/recv_object_list function and serialize method for placement object (#72098) * add___reduce___method * support send/recv_object_list * polish api and add tests * typing-fix * typing-fix * fix_tests --------- Co-authored-by: zty-king <17786324919@163.com> 2025-05-08 20:50:51 +08:00			`from paddle.distributed.communication.group import (`
			`_get_global_group,`
			`_warn_cur_rank_not_in_group,`
			`)`
			`from paddle.distributed.communication.serialization_utils import (`
			`convert_object_to_tensor,`
			`)`
move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00
[Typing][C-21,C-22,C-23,C-24] Add type annotations for `python/paddle/distributed/communication/{reduce_scatter.py, scatter.py, send.py, all_gather.py}` (#66864) --------- Co-authored-by: megemini <megemini@outlook.com> Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-08-06 09:58:04 +08:00			`if TYPE_CHECKING:`
			`from paddle import Tensor`
			`from paddle.base.core import task`
			`from paddle.distributed.communication.group import Group`

move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00
[Typing][C-21,C-22,C-23,C-24] Add type annotations for `python/paddle/distributed/communication/{reduce_scatter.py, scatter.py, send.py, all_gather.py}` (#66864) --------- Co-authored-by: megemini <megemini@outlook.com> Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-08-06 09:58:04 +08:00			`def send(`
			`tensor: Tensor,`
			`dst: int = 0,`
			`group: Group \| None = None,`
			`sync_op: bool = True,`
			`) -> task \| None:`
move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00			`"""`
			`Send a tensor to the receiver.`

			`Args:`
			`tensor (Tensor): The Tensor to send. Its data type`
			`should be float16, float32, float64, int32, int64, int8, uint8, bool or bfloat16.`
			`dst (int): The destination rank id.`
			`group (Group, optional): The group instance return by new_group or None for global default group. Default: None.`
			`sync_op (bool, optional): Whether this op is a sync op. The default value is True.`

			`Returns:`
			`Return a task object.`

			`Examples:`
[CodeStyle][DocFormat][130] Use `pycon` marker in distributed communication docs examples (#77994) 2026-02-20 11:41:28 +08:00			`.. code-block:: pycon`
move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00
[xdoctest] reformat example code with google style in No. 270 275-280 (#56476) 2023-08-22 19:03:40 +08:00			`>>> # doctest: +REQUIRES(env: DISTRIBUTED)`
			`>>> import paddle`
			`>>> import paddle.distributed as dist`

			`>>> dist.init_parallel_env()`
			`>>> if dist.get_rank() == 0:`
			`... data = paddle.to_tensor([7, 8, 9])`
			`... dist.send(data, dst=1)`
			`>>> else:`
			`... data = paddle.to_tensor([1, 2, 3])`
			`... dist.recv(data, src=0)`
			`>>> print(data)`
			`>>> # [7, 8, 9] (2 GPUs)`
move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00			`"""`
delete legacy dygraph code in python/paddle/distributed (#49304) * delete legacy dygraph code in python/paddle/distributed * refine 2022-12-25 16:58:05 +08:00			`return stream.send(`
			`tensor, dst=dst, group=group, sync_op=sync_op, use_calc_stream=False`
move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00			`)`


[Typing][C-21,C-22,C-23,C-24] Add type annotations for `python/paddle/distributed/communication/{reduce_scatter.py, scatter.py, send.py, all_gather.py}` (#66864) --------- Co-authored-by: megemini <megemini@outlook.com> Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-08-06 09:58:04 +08:00			`def isend(tensor: Tensor, dst: int, group: Group \| None = None) -> task \| None:`
move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00			`"""`
			`Send tensor asynchronously`

			`Args:`
			`tensor (Tensor): The Tensor to send. Its data type`
			`should be float16, float32, float64, int32, int64, int8, uint8, bool or bfloat16.`
			`dst (int): The destination rank.`
			`group (Group, optional): The group instance return by new_group or None for global default group. Default: None.`

			`Returns:`
			`Return a task object.`

			`Warning:`
			`This API only supports the dygraph mode.`

			`Examples:`
[CodeStyle][DocFormat][130] Use `pycon` marker in distributed communication docs examples (#77994) 2026-02-20 11:41:28 +08:00			`.. code-block:: pycon`
move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00
[xdoctest] reformat example code with google style in No. 270 275-280 (#56476) 2023-08-22 19:03:40 +08:00			`>>> # doctest: +REQUIRES(env: DISTRIBUTED)`
			`>>> import paddle`
			`>>> import paddle.distributed as dist`

			`>>> dist.init_parallel_env()`
			`>>> if dist.get_rank() == 0:`
			`... data = paddle.to_tensor([7, 8, 9])`
			`... task = dist.isend(data, dst=1)`
			`>>> else:`
			`... data = paddle.to_tensor([1, 2, 3])`
			`... task = dist.irecv(data, src=0)`
[Typing][C-21,C-22,C-23,C-24] Add type annotations for `python/paddle/distributed/communication/{reduce_scatter.py, scatter.py, send.py, all_gather.py}` (#66864) --------- Co-authored-by: megemini <megemini@outlook.com> Co-authored-by: SigureMo <sigure.qaq@gmail.com> 2024-08-06 09:58:04 +08:00			`>>> task.wait() # type: ignore[union-attr]`
[xdoctest] reformat example code with google style in No. 270 275-280 (#56476) 2023-08-22 19:03:40 +08:00			`>>> print(data)`
			`>>> # [7, 8, 9] (2 GPUs)`
move broadcast, reduce, send, recv, reduce_scatter, scatter, alltoall (#47255) 2022-11-04 21:05:12 +08:00
			`"""`
			`return send(tensor, dst, group, sync_op=False)`
[AutoParallel] send/recv_object_list function and serialize method for placement object (#72098) * add___reduce___method * support send/recv_object_list * polish api and add tests * typing-fix * typing-fix * fix_tests --------- Co-authored-by: zty-king <17786324919@163.com> 2025-05-08 20:50:51 +08:00

			`def send_object_list(`
			`object_list: list[Any],`
			`dst: int \| None = None,`
			`group: Group \| None = None,`
			`dst_in_group: int \| None = None,`
			`):`
			`"""`
			`Send a list of Python objects to the receiver.`

			`Args:`
			`object_list (list): The list of Python objects to send.`
			`dst (int, optional): The destination rank id. Default: 0.`
			`group (Group, optional): The group instance return by new_group or None for global default group. Default: None.`
			`dst_in_group (int, optional): The destination rank within the group. Cannot be specified together with dst. Default: None.`

			`Returns:`
			`This function does not return any value.`

			`Examples:`
[CodeStyle][DocFormat][130] Use `pycon` marker in distributed communication docs examples (#77994) 2026-02-20 11:41:28 +08:00			`.. code-block:: pycon`
[AutoParallel] send/recv_object_list function and serialize method for placement object (#72098) * add___reduce___method * support send/recv_object_list * polish api and add tests * typing-fix * typing-fix * fix_tests --------- Co-authored-by: zty-king <17786324919@163.com> 2025-05-08 20:50:51 +08:00
			`>>> # doctest: +REQUIRES(env: DISTRIBUTED)`
			`>>> import paddle`
			`>>> import paddle.distributed as dist`

			`>>> dist.init_parallel_env()`
			`>>> if dist.get_rank() == 0:`
			`... data = ["hello", {"key": 100}, [1, 2, 3]]`
			`... dist.send_object_list(data, dst=1)`
			`>>> else:`
			`... data = [None] * 3 # type: ignore`
			`... dist.recv_object_list(data, src=0)`
			`>>> print(data)`
			`>>> # ["hello", {"key": 100}, [1, 2, 3]] (2 GPUs)`
			`"""`
			`if object_list is None or len(object_list) == 0:`
			`raise ValueError("object_list cannot be None or empty")`

			`group = _get_global_group() if group is None else group`
			`if _warn_cur_rank_not_in_group(group):`
			`return`

			`if dst_in_group is not None:`
			`if dst is not None:`
			`raise ValueError(`
			`"Cannot specify both 'dst' and 'dst_in_group' arguments."`
			`)`
			`dst = group.get_global_rank(dst_in_group)`
			`else:`
			`dst = 0 if dst is None else dst`

			`# Convert objects to tensors and get their sizes`
			`tensor_list, size_list = zip(`
			`*[convert_object_to_tensor(obj) for obj in object_list]`
			`)`
			`size_list_values = [size.item() for size in size_list]`

			`# Send sizes first`
			`object_sizes_tensor = paddle.to_tensor(size_list_values, dtype='int64')`
			`send(object_sizes_tensor, dst=dst, group=group)`

			`# Send object data`
			`if len(tensor_list) == 1:`
			`object_tensor = tensor_list[0]`
			`else:`
			`object_tensor = paddle.concat(tensor_list)`
			`send(object_tensor, dst=dst, group=group)`