Blame: python/paddle/distributed/parallel_helper.py - PaddlePaddle/Paddle

PaddlePaddle / Paddle UNCLAIMED

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

23800 0 0 C++

Normal View History Raw

Add broadcast operators (#17503) * This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters. 2019-05-24 13:20:41 +08:00			`# Copyright (c) 2019 PaddlePaddle Authors. All Rights Reserved.`
			`#`
			`# Licensed under the Apache License, Version 2.0 (the "License");`
			`# you may not use this file except jin compliance with the License.`
			`# You may obtain a copy of the License at`
			`#`
			`# http://www.apache.org/licenses/LICENSE-2.0`
			`#`
			`# Unless required by applicable law or agreed to in writing, software`
			`# distributed under the License is distributed on an "AS IS" BASIS,`
			`# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.`
			`# See the License for the specific language governing permissions and`
			`# limitations under the License.`
			`import os`
Fluid clean parallel (#50626) * fluid clean: remove parallel and parallel_helper api * fix: fix the import path. * fix DataParallel imports issue 2023-03-02 18:57:07 +08:00
update parallel_helper (#17691) test=develop 2019-05-28 21:31:28 +08:00			`from ..framework import Parameter`
【code format check upgrade】 step2：yapf (#42944) * use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed 2022-06-05 10:58:58 +08:00
Add broadcast operators (#17503) * This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters. 2019-05-24 13:20:41 +08:00			`__parallel_ctx__clz__ = None`


			`def _is_data_parallel_mode():`
			`global __parallel_ctx__clz__`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`return (`
			`__parallel_ctx__clz__ is not None`
			`and int(os.getenv("PADDLE_TRAINERS_NUM", "1")) > 1`
			`)`
Add broadcast operators (#17503) * This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters. 2019-05-24 13:20:41 +08:00

Add interface to launch parallel dygraph by multiprocessing (#26044) * add dygraph parallel run interface * polish implement & unified env property name * add print config arg * refactor init_parallel_env function * Compatible with multiprocessing and launch modes * set default trainer start port * support run in python 2 * polish python2 support code * remove python2 support * refine launch import * polish dome design details * refactor api implemention & path * use new method _set_expected_place * add spawn unittest framework & mnist test * add more unittests & doc * fix unittest failed * polish english doc * self review and polish details * refactor code by reviewer's comments * fix unittest failed * fix parallel_env unittest * fix several typos * fix error introduced when fixing typos * add unpublic note for start_processes * polish details by xiaoguang's comment * verify correctly when spawn nprocs=-1 * refactor spawn & init_parallel_env design * polish doc details * open spawn unittests * try to fix doc compile error * try to fix unknown doc format error * add skip unittest when not gpu 2020-08-28 14:46:28 +08:00			`def _is_parallel_ctx_initialized():`
			`global __parallel_ctx__clz__`
			`return __parallel_ctx__clz__ is not None`


heter for collective (#37613) 2021-12-06 09:01:15 +08:00			`def _set_parallel_ctx(ccl_parallel_context):`
Add broadcast operators (#17503) * This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters. 2019-05-24 13:20:41 +08:00			`global __parallel_ctx__clz__`
[CodeStyle] `black -> ruff format` migration - part 29 (#74743) 2025-08-21 02:00:58 +08:00			`assert __parallel_ctx__clz__ is None, (`
			`"ParallelContext can only be initialized once."`
			`)`
heter for collective (#37613) 2021-12-06 09:01:15 +08:00			`__parallel_ctx__clz__ = ccl_parallel_context`
Add broadcast operators (#17503) * This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters. 2019-05-24 13:20:41 +08:00

			`def _init_parallel_ctx():`
			`global __parallel_ctx__clz__`
[CodeStyle] `black -> ruff format` migration - part 29 (#74743) 2025-08-21 02:00:58 +08:00			`assert __parallel_ctx__clz__ is not None, (`
			`"ParallelContext should be initialized."`
			`)`
Add broadcast operators (#17503) * This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters. 2019-05-24 13:20:41 +08:00			`__parallel_ctx__clz__.init()`


			`def _broadcast_parameters(parameters):`
Fluid clean parallel (#50626) * fluid clean: remove parallel and parallel_helper api * fix: fix the import path. * fix DataParallel imports issue 2023-03-02 18:57:07 +08:00			`from ..distributed import broadcast`

Add broadcast operators (#17503) * This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters. 2019-05-24 13:20:41 +08:00			`for param in parameters:`
add the paddle.distributed.split api (#29970) * add distributed.split, test=develop 2020-12-31 13:13:44 +08:00			`# In model parallel, some parameters are split into multiple devices,`
			`# so we could not broadcast these parameters.`
[CodeStyle][black] use black instead of yapf (#46014) * update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format 2022-10-23 20:01:27 +08:00			`if param.is_distributed:`
			`continue`
update parallel_helper (#17691) test=develop 2019-05-28 21:31:28 +08:00			`if isinstance(param, Parameter) and param.trainable:`
Fluid clean parallel (#50626) * fluid clean: remove parallel and parallel_helper api * fix: fix the import path. * fix DataParallel imports issue 2023-03-02 18:57:07 +08:00			`broadcast(param, 0, sync_op=True)`