COMPSO: Optimizing Gradient Compression for Distributed Training with Second-Order Optimizers

for Deep Neural Networks via Communication Reduction

Implementation based on K-FAC pytorch (https://github.com/gpauloski/kfac-pytorch)

Artifact for PPoPP'25 Paper