Issues
- 3
[QUESTION] Not supported on A6000?
#46 opened by Zhuohao-Li - 0
- 5
- 6
[QUESTION] Can Gemm_V3 be used in SM80?
#38 opened by ginowu - 0
- 1
[QUESTION]some questions about allgather+gemm
#42 opened by ChrisRanger - 0
[QUESTION] Can flux run on RTX 4090?
#43 opened by qinghon - 8
[BUG] Failing to install byte-flux from pypi
#30 opened by tlrmchlsmth - 2
- 5
- 1
[BUG] Illegal memory with multi-node
#40 opened by YJHMITWEB - 8
[QUESTION] How to use nvshmem?
#33 opened by chenhongyu2048 - 1
[QUESTION]is there a plan to support int8?
#31 opened by Rainlin007 - 2
- 3
- 6
[BUG] `no_nvlink` branch failed to compile
#32 opened by lucifer1004 - 3
[ENHANCEMENT] support for gpu A40
#35 opened by 1926627357 - 8
[QUESTION] The gemm time on GPU of different rank under tp8 is very different , and cause low performance
#36 opened by Rainlin007 - 2
[QUESTION] Why is ring mode fixed to `All2All` in `src/all_gather/ths_op/all_gather_types.h`?
#37 opened by lucifer1004 - 5
- 1
[QUESTION]How to run examples in pynvshmem
#29 opened by TonyWu199 - 15
- 2
- 14
[BUG] RuntimeError: Could not retrieve or create the backend 2 for device type cuda
#11 opened by tlrmchlsmth - 9
[BUG] gemm and reduce-scatter are not overlapped
#7 opened by wenscarl - 4
[BUG] Illegal memory access in GemmRS when passing fuse_reduction=True and dtype=bfloat16
#8 opened by tlrmchlsmth