4paradigm/k8s-vgpu-scheduler
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.
GoApache-2.0
Pinned issues
Issues
- 2
vgpu-scheduler container kube-scheduler error
#43 opened by G-Kolls - 0
消费级显卡支持么
#42 opened by 13567436138 - 0
支持新版资源调度么,比如resourceClaim方式
#41 opened by 13567436138 - 0
如何在Prometheus里监控gpu的使用情况
#40 opened by efeng-blue - 0
vgpu-device-plugin CreateContainerError
#37 opened by thinkeng - 0
可以支持一下 kubeedge 么
#38 opened by thinkeng - 1
- 3
- 0
- 0
vgpu repo had not found
#32 opened by swimtobird - 1
- 0
failed calling webhook "vgpu.4pd.io"
#29 opened by Tweakzx - 1
- 0
请问下 libvgpu.so 的代码 可以开源么?
#26 opened by zhengborong - 0
run nvidia-smi err in pod
#25 opened by chenyangxueHDU - 1
- 2
- 0
Handle_remap not found handle
#22 opened by RexQian - 0
- 0
how to install in openshift4 ?
#20 opened by fu7100 - 3
can't find function nvmlDeviceGetComputeRunningProcesses_v2 in libnvidia-ml.so.1
#4 opened by haijohn - 2
- 1
我使用的是v0.9.0.0这个版本,build之后,部署为daemon服务到 GPU节点, 报device-split-count等几个参数未定义,去掉这几个参数后,POD可正常在GPU节点running;但看日志找到不到NVML,GPU节点是P100,求联系求指导
#9 opened by AlexPei - 4
切分10份,但是VGPU显存无变化 NVIDIA A100
#19 opened by Dripman - 1
Is there a way to monitor vGPU with DCGM?
#18 opened by rjanovski - 5
分配2张vgpu却只能看到1张
#17 opened by xwhuang0923 - 1
- 1
两张GPU,只识别了一张卡
#16 opened by absolutelyZero - 0
Segmentation fault (core dumped)
#14 opened by bingMillion - 2
can use for 2080Ti/1080Ti ?
#13 opened by sunhao12121 - 3
- 4
- 2
- 3
commited image can not run in another node.
#8 opened by haijohn - 8
- 1
能否增加选择指定GPU切分
#2 opened by GuoYingLong - 0
切分功能不起作用,请求帮助?
#3 opened by absolutelyZero