Issues
- 4
Is this implementation of the kfac algorithm compatible with the nn module which is not a distributed module?
#150 opened by zhangylch - 1
Adaptive damping
#113 opened by phuvm72 - 3
Using K-FAC training BERT-Large model
#117 opened by RuidongYan666 - 1
- 2
This code uses the empirical fisher matrix instead of actual fisher matrix.
#109 opened by siddharth9820 - 1
Create wiki
#39 opened by gpauloski - 0
Use ncclAvg reduce op instead of future hook that divide tensor by world size
#34 opened by gpauloski - 0
Add Apex as optional requirement
#51 opened by gpauloski - 1
Use allreduce_coalesced for factor allreduce
#35 opened by gpauloski - 2
- 2
The problem of this new edition project
#86 opened by Elec-coder - 0
Add `torch` as mypy additional dependency
#59 opened by gpauloski - 0
Regression in Layer Registration Recursion
#53 opened by gpauloski - 2
Running problems
#57 opened by Elec-coder - 3
Running Problems
#54 opened by Elec-coder - 0
Print env info in example scripts
#44 opened by gpauloski - 0
Remove tags in titles in issue templates
#45 opened by gpauloski - 0
Add BatchNorm support
#23 opened by gpauloski - 0
Add diagonal approximation for linear layers
#26 opened by gpauloski - 0
Add Tensor Communication Bucketing
#32 opened by gpauloski - 0
- 1
Add gate support to linear multi layers
#24 opened by gpauloski - 1
- 3
When i run the example tran_imagenet_resnet.py by pytorch, dist.broadcast: RuntimeError: Tensors must be contiguous
#31 opened by JLtwoP - 0
Test KFAC load/save state dict
#27 opened by gpauloski - 1
Investigate embedding layer performance.
#25 opened by gpauloski - 1
Support factor_decay=1
#20 opened by gpauloski - 1
- 1
- 0
Test broadcast eigens instead of allreduce(sum)
#14 opened by gpauloski - 0
- 1
Add gradient accumulation to imagenet example
#22 opened by gpauloski - 0
add lr tensorboard logging to examples
#8 opened by gpauloski - 0
- 0
Add damping decay
#15 opened by gpauloski - 0
Add kfac update freq decay
#16 opened by gpauloski - 1
Remove redundant communication for kFAC supported layers when reducing non-KFAC supported layers
#9 opened by gpauloski - 1
- 1
Check if all KFAC tensors in cuda mem
#4 opened by gpauloski - 0
- 0
Fix training scripts
#3 opened by gpauloski - 0
Cite repos I found resnet models from
#1 opened by gpauloski