Pinned Repositories
Cache
Development based on llama.cpp
Cryptography
my class
Distributed-llama
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
llamacpp_cluster
Focus on MOE cluster inference in C/C++
VLSI_testing
NYCU_class
ZJkyle's Repositories
ZJkyle/Cache
Development based on llama.cpp
ZJkyle/Cryptography
my class
ZJkyle/Distributed-llama
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
ZJkyle/llamacpp_cluster
Focus on MOE cluster inference in C/C++
ZJkyle/VLSI_testing
NYCU_class