Issues
- 8
sanity check with lingua
#50 opened by SeunghyunSEO - 4
- 0
ImportError in latest version
#38 opened by francois-rozet - 3
[bug]: `ImportError`: Cannot import name `DeviceMesh` from `torch.distributed`
#36 opened by saforem2 - 3
Using Shampoo with Accelerate and FSDP
#24 opened by kfirgoldberg - 5
Failed to compute eigendecomposition
#13 opened by aykamko - 2
Empty params in FSDP cause issue
#23 opened by odegeasslbc - 2
ValueError when start_preconditioning_step value=0 and precondition_frequency=1
#30 opened by bregaldo - 1
- 3
pip install-able
#16 opened by windsornguyen - 1
Fails from DeepSpeed
#19 opened by catid-saronic - 1
Typo in Algorithm 2 in the Distributed Shampoo paper: beta1 used instead of beta2
#21 opened by jondeuce - 1
- 2
ModuleNotFoundError: No module named 'optimizer_modules', while trying to import the DistributedShampoo class
#15 opened by ExpressGradient - 2
using shampoo with distributed training
#2 opened by maxmatical - 1
Add setup.py
#5 opened by LouisCastricato - 5
resuming shampoo from checkpoint results in preconditioners with tensors on wrong device
#3 opened by rwightman - 3
Expectations
#1 opened by avinashsai