Issues
- 2
Monitor GPU poll is bad
#329 opened by Delaunay - 1
Compute model floating point operation and estimate the FLOPS achieved vs theoretical
#320 opened by Delaunay - 1
Document need for huggingface token
#326 opened by csubich - 0
Add Profiling Option
#317 opened by Delaunay - 0
NGC versus 2.5
#321 opened by Delaunay - 0
Perf evolution from 2.4 to 2.5
#318 opened by Delaunay - 0
Improve reproduction steps
#319 opened by Delaunay - 2
[HPU] llava-single
#300 opened by Delaunay - 2
[HPU] llm-full-mp-gpus
#299 opened by Delaunay - 1
[HPU] llm-lora-ddp-gpus
#298 opened by Delaunay - 0
[HPU] llm-lora-mp-gpus
#305 opened by Delaunay - 0
- 0
[HPU] torch geometric
#306 opened by Delaunay - 1
[HPU] reformer
#301 opened by Delaunay - 4
[HPU] rlhf
#303 opened by Delaunay - 0
[rocm] amdsmi not found
#256 opened by Delaunay - 0
[rocm] rlhf
#304 opened by Delaunay - 0
- 0
Torch Geometric on ROCm
#296 opened by Delaunay - 0
Jax & CUDA
#258 opened by Delaunay - 2
rank 0 is not on the local node
#282 opened by Delaunay - 1
llm-lora-single on HPU
#297 opened by Delaunay - 5
vjepa-single HPU
#294 opened by Delaunay - 1
Jax on ROCm GPUs
#295 opened by Delaunay - 6
- 0
Use llama3 for inference
#255 opened by Delaunay - 0
Milabench Size
#254 opened by Delaunay - 0
light docker image
#244 opened by Delaunay - 0
Data pack redistribution
#239 opened by Delaunay - 5
ptera crashes on Gracehooper
#216 opened by Delaunay - 1
Install & Prepare should work without GPU
#226 opened by Delaunay - 0
Document batch resizing
#211 opened by Delaunay - 3
v0.0.7 master branch: local variable 'timeout_task' referenced before assignment
#164 opened by luop0812 - 4
NonMatchingSplitsSizesError on Prepare Step
#184 opened by ArjunRamaswami22 - 4
Failures when running on a 4 GPU server
#183 opened by guylaporte - 3
CUDA out of memory
#192 opened by lcebaman - 0
Validation Layers
#57 opened by Delaunay - 0
`milabench --help` raise
#103 opened by Delaunay - 1
AMD: Use all available GPUs for multi-gpu tests
#46 opened by Delaunay - 1
RL benchmarks download atari ROM from website that are often behind company firewall
#69 opened by Delaunay - 1
vit_l_32 fails on H100 with pytroch nightly
#70 opened by Delaunay - 18
Error running milabench with H100 GPU
#44 opened by guylaporte - 0
Investigate Batch size scaling in DDP setups
#60 opened by Delaunay - 1
Add a flag to make milabench exit with -1 to make tests fails when something is wrong
#54 opened by Delaunay - 5
- 3
milabench install requires GPU
#42 opened by Delaunay - 5
Problem installing milabench
#41 opened by guylaporte - 2
MILABENCH_GPU_ARCH undocumented
#39 opened by gravitino - 2
Use NVML instead of parsing nvidia-smi output
#38 opened by gravitino - 1