evilsocket/cake

Distributed LLM and StableDiffusion inference for mobile, desktop and server.

RustNOASSERTION

Issues

build error on compiling with cuda features
#29 opened a month ago by venusarathy
0
request for the second time error details
#26 opened 2 months ago by ddbbcc
0
[Feature request] Will cake be able to train model in the future?
#25 opened 3 months ago by fish4terrisa-MSDSM
0
Inquiries about the possibility of supporting windows systems
#7 opened 3 months ago by longxiaocao
3
Need support for fine-tuning glm and more fine-tuning models
#24 opened 3 months ago by shenkunlovecoding
0
May I ask why I am unable to download the model and use the product through Huggingface
#23 opened 3 months ago by Caesarleee
1
Is it possible to use quantized models?
#22 opened 3 months ago by ManuXD32
6
Standardized Android client
#14 opened 3 months ago by TriDefender
3
Dockerfile support
#12 opened 3 months ago by James4Ever0
2
Req Support for Llama 3.1
#21 opened 3 months ago by jkfnc
3
Error in model.forward: error in forward batch operation for block
#13 opened 3 months ago by hash-momo
5
第二次请求会报错
#19 opened 3 months ago by JKYtydt
3
bug with tokenizer and gibberish output
#9 opened 3 months ago by evilsocket
4
Thanks for the FOSS! Suggestion for future possible backends runtimes: Vulkan, OpenCL, SYCL/OpenVino/intel GPU, AMD gpu/ROCm/HIP.
#20 opened 3 months ago by ghchris2021
2
Unable to build without CUDA
#16 opened 3 months ago by damnkrat
2
Use hf_hub crate to pull model
#15 opened 3 months ago by b0xtch
1
About the reason of having cluster nodes
#10 opened 3 months ago by hafezmg48
3
PTX代码使用了一个不被支持的工具链进行编译
#11 opened 3 months ago by JKYtydt
4
无法找到指定的文件
#8 opened 3 months ago by JKYtydt
9
无法编译成功
#6 opened 3 months ago by wwdhf
1
Questions for Xcode setup for testing/dev
#4 opened 3 months ago by chadbrewbaker
1
Cross-device mapping
#2 opened 4 months ago by b0xtch
2
Building on ubuntu errors `cuMemAdvise_v2` on cuda 12.1
#3 opened 4 months ago by b0xtch
3