Issues
- 2
rank 0 is not on the local node
#282 opened by Delaunay - 6
- 0
Jax & CUDA
#258 opened by Delaunay - 0
amdsmi not found
#256 opened by Delaunay - 0
Use llama3 for inference
#255 opened by Delaunay - 0
Milabench Size
#254 opened by Delaunay - 0
light docker image
#244 opened by Delaunay - 0
Data pack redistribution
#239 opened by Delaunay - 5
ptera crashes on Gracehooper
#216 opened by Delaunay - 1
Install & Prepare should work without GPU
#226 opened by Delaunay - 0
Document batch resizing
#211 opened by Delaunay - 3
v0.0.7 master branch: local variable 'timeout_task' referenced before assignment
#164 opened by luop0812 - 4
NonMatchingSplitsSizesError on Prepare Step
#184 opened by ArjunRamaswami22 - 4
Failures when running on a 4 GPU server
#183 opened by guylaporte - 3
CUDA out of memory
#192 opened by lcebaman - 0
Validation Layers
#57 opened by Delaunay - 0
`milabench --help` raise
#103 opened by Delaunay - 1
AMD: Use all available GPUs for multi-gpu tests
#46 opened by Delaunay - 1
RL benchmarks download atari ROM from website that are often behind company firewall
#69 opened by Delaunay - 1
vit_l_32 fails on H100 with pytroch nightly
#70 opened by Delaunay - 6
use https instead of git uris
#4 opened by tbugfinder - 2
move anaconda installation
#5 opened by tbugfinder - 1
scaling benchmark fails
#1 opened by nileshnegi - 18
Error running milabench with H100 GPU
#44 opened by guylaporte - 2
Allow to override dependencies at the top level
#20 opened by Delaunay - 0
- 1
provide prebuilt docker image
#2 opened by tbugfinder - 0
Investigate Batch size scaling in DDP setups
#60 opened by Delaunay - 1
Add a flag to make milabench exit with -1 to make tests fails when something is wrong
#54 opened by Delaunay - 5
- 8
Milabench v2 on AMD
#21 opened by Delaunay - 0
Add AMD GPU monitoring
#22 opened by Delaunay - 3
milabench install requires GPU
#42 opened by Delaunay - 5
Problem installing milabench
#41 opened by guylaporte - 2
MILABENCH_GPU_ARCH undocumented
#39 opened by gravitino - 2
Use NVML instead of parsing nvidia-smi output
#38 opened by gravitino - 1
- 2
soft_actor_critic crash because of API change
#28 opened by Delaunay - 0
soft_actor_critic crash because of API change
#27 opened by Delaunay - 3
[low] ptera.selfless.ConflictError: Multiple values with same priority conflict for variable 'base'
#9 opened by lebrice - 1
Create license
#6 opened by fosterrath-mila - 1
build using podman/buildah
#3 opened by tbugfinder