owensgroup/ml_perf_model
ML performance model for GPU training of DLRM and more.
Jupyter NotebookBSD-3-Clause
Issues
- 1
Convolution perf model generalizatoin
#10 opened by moderato - 1
Study of overhead stats
#8 opened by moderato - 3
- 2
Multi-node multi-GPU support
#6 opened by moderato - 0
Migrate from nvprof to nsys/ncu
#2 opened by moderato - 0
PyTorch custom operator inputs fix (emb op)
#12 opened by louisfeng - 1
- 1
Get rid of profiler overheads in E2E
#1 opened by moderato - 1
- 1
Code refactor
#5 opened by moderato - 0
Integrate with PARAM
#3 opened by moderato