tencentmusic/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
Jupyter NotebookNOASSERTION
Issues
- 1
- 2
如何接入label studio
#242 opened by U201311 - 1
pipeline和jupyter两个namespace内的pod均提示“stat /data/k8s/kubeflow/pipeline/workspace: no such fire or directory",请问该如何调整呢?现在均卡在ContainerCreating阶段。
#248 opened by chengadmin80 - 2
二次开发
#193 opened by charlesXu86 - 1
关于start.sh
#246 opened by ZhenpengChenCode - 0
管理员账号密码默认是多少呢?
#247 opened by jiayi-1994 - 0
请问链接公众号和企业微信部分的代码在哪里啊?
#245 opened by bigbrother666sh - 0
可以新增多模态信息抽取模型?
#244 opened by tianchiguaixia - 1
- 4
登录界面没有相应代码
#237 opened by 981860146 - 1
单机K8s部署后,cube前端页面过十几分钟就不能访问了,需要重启pod才行。
#197 opened by JankinHou - 0
- 1
单机部署 数据库可以用单独机器的数据库吗
#239 opened by hqfgithub - 0
创建notebook后,点进去404
#241 opened by dataknocker - 0
docker-compose部署单机出现版本问题
#240 opened by msj905 - 2
- 1
如何将Geforce RTX3090GPU 加入平台的K8S集群中?
#236 opened by provenclei - 1
怎么更新已经单机部署好的cube-studio平台
#233 opened by ykx25 - 1
异构算力支持?
#221 opened by jeremyjiao - 1
- 1
私有仓库镜像节点无法运行 failed to look-up entrypoint/cmd for image \"172.40.20.82:8443/aiclube/model_download\"
#229 opened by gfoxlin - 1
单机部署之后访问http:xx.xx.xx.xx 返回rancher web页面咋回事呀,rancher web按照部署手册也是设置的xx.xx.xx.xx
#225 opened by LazyCatLee - 1
项目空间-资源配置模块为disabled状态
#228 opened by realtyz - 2
- 1
启动以后【标注平台】功能入口是disabled的状态
#227 opened by HeavenDuke - 1
external_ip 多ip 逻辑bug
#223 opened by YanSongCode - 1
- 2
torchserve部署服务能否更新镜像
#214 opened by jinghao2eebd - 1
数据集删除遗留问题
#224 opened by taogezhizun - 0
请问是否支持推理服务部署在非k8s集群机器上
#231 opened by super-lkl - 0
aihub运行中存在的前端bug
#187 opened by chendile - 2
数据集 批量导入后报错,显示页面有问题
#204 opened by miracletiger - 1
reset_docker.sh 将所有容器和卷都清除了
#208 opened by danerlt - 1
批量导出报错
#216 opened by InvincibleMinions - 1
dataset模板导入数据提示not found for url
#217 opened by InvincibleMinions - 2
【模型训练】 创建的任务模板无法删除
#222 opened by adgers - 2
- 2
云主机部署无法通过gateway的80端口打开
#215 opened by miracletiger - 2
请问模型训练的结果可视化,类似tensorboard这种的功能打算怎么实现么?
#186 opened by wanyy15083 - 2
kubeflow-dashboard start error
#202 opened by Wanglw6 - 1
- 1
安装问题关于已存在kubeflow prometheus
#205 opened by 631068264 - 1
添加notebook后,平台页面无法打开
#206 opened by skynewborn - 2
启动服务时镜像找不到
#207 opened by zheng1xin - 2
wsl2 部署发现这个问题
#198 opened by allenliuvip - 1
4.1号版本 在线开发生成vs规则后导致整个访问都是404
#196 opened by lkad - 1
Dockerfile-base打包出错
#195 opened by lkad - 2
打开任务模板和任务流报错
#176 opened by ZXTFINAL - 1
路由增加https反代,login跳转返回http的协议
#189 opened by lkad - 0
目前的kubeflow_dashboard的Dockerfile和base的dockerfile有问题
#175 opened by ZXTFINAL