DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Add flops profiler tutorial (#682)
* work on flops profiler tutorial * update flops profiler tutorial * add flops profiler tutorial and fix names * work on flops profiler tutorial * update flops profiler tutorial * add flops profiler tutorial and fix names * fix tailing ws * fix names * remove multistep profiling and update docs * fix cases where functionals and submodules coexist in a parent module, update readme * fix typo * always invoke post hook function * fix module flops sum and update tests * update tutorial
C
Cheng Li committed
e2dfe0d17be21ce08e759dab4712bc37debd44c4
Parent: 6ee3b29
Committed by GitHub <noreply@github.com>
on 2/11/2021, 2:03:55 AM