kakaobrain/torchgpipe

A GPipe implementation in PyTorch

PythonBSD-3-Clause

Issues

The same batch size, different micro batches, the algorithm effects are inconsistent.
#35 opened 8 months ago by Kurama622
3
To utilized torchgpipe into normal double convolution UNet
#34 opened 2 years ago by yoonguusong
1
CUDA issues
#33 opened 2 years ago by linghan1997
0
RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same
#32 opened 3 years ago by joelrorseth
2
Whether to support multi-machine training ？
#7 opened 5 years ago by yangpc615
4
[Question] Inference time speed up or not?
#31 opened 3 years ago by tuanmanh1410
1
1
#30 opened 3 years ago by xiyiyia
0
Could torchgpipe run on CPU-only machines?
#29 opened 3 years ago by HuYang719
4
About the debugger for running torchgpipe
#26 opened 3 years ago by Real-ZeminJiang
3
Issues on torchgpipe project and paper
#28 opened 3 years ago by xshaun
1
Failed when trying to use auto partition on running resnet101-speed benchmark
#27 opened 4 years ago by GeorgeQ-Q
0
Not sure if the backpropagation here also follows pipeline
#5 opened 4 years ago by lynex
6
[Question] What is the purpose of "always" checkpointing mode?
#24 opened 4 years ago by pritamdamania87
1
Dual-license as BSD3 for PyTorch integration
#20 opened 4 years ago by pritamdamania87
7
torchpipe + megatron
#23 opened 4 years ago by eric-haibin-lin
1
Work with DistributedDataParallel
#16 opened 4 years ago by YHRen
5
Can torchgpipe work with Horovod?
#22 opened 4 years ago by RainFrost1
0
A more memory-efficient implementation
#19 opened 4 years ago by anxuthu
1
Forward in eval() mode.
#17 opened 4 years ago by anxuthu
3
About speedup
#18 opened 4 years ago by anxuthu
2
Why does `Copy` compute gradients in reversed order
#15 opened 4 years ago by MlWoo
6
How did you handle batch norm?
#14 opened 4 years ago by nirandaperera
2
Using GPipe for Hessian Computation
#13 opened 5 years ago by singhsarvagya
10
Checkpoint Issues
#12 opened 5 years ago by vibhatha
11
Benchmark Performance for Baseline vs Pipeline-1
#11 opened 5 years ago by vibhatha
5
Gpipe Benchmark
#10 opened 5 years ago by vibhatha
11
convergence problem
#9 opened 5 years ago by yangpc615
6
KeyError in Stash
#8 opened 5 years ago by wentaozhu
8
Hello, does it work for a detection network with branches?
#6 opened 5 years ago by yangpc615
2
Reproducing GPipe accuracy results
#1 opened 5 years ago by 0xsamgreen
3
Typo in documentation
#4 opened 5 years ago by Kyeongpil
1
Balance Module
#3 opened 5 years ago by 842974287
5
Question about worker thread in GPipe
#2 opened 5 years ago by 842974287
4