Making large AI models cheaper, faster and more accessible
[shardformer] hybridparallelplugin support gradients accumulation. (#5246)
* support gradients acc fix fix fix fix fix fix fix fix fix fix fix fix fix * fix fix * fix fix fix
F
flybird11111 committed
46e091651bfa161672c46de5153f2747921a40cd
Parent: 2a0558d
Committed by GitHub <noreply@github.com>
on 1/17/2024, 7:22:33 AM