ZJkyle

Pinned Repositories

Cache
Development based on llama.cpp
Language:C++00
Cryptography
my class
Language:Python00
Distributed-llama
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
Language:C++00
llamacpp_cluster
Focus on MOE cluster inference in C/C++
Language:C++00
VLSI_testing
NYCU_class
Language:C++00

ZJkyle's Repositories

ZJkyle/Cache
Development based on llama.cpp
Language:C++00
ZJkyle/Cryptography
my class
Language:Python00
ZJkyle/Distributed-llama
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
Language:C++00
ZJkyle/llamacpp_cluster
Focus on MOE cluster inference in C/C++
Language:C++00
ZJkyle/VLSI_testing
NYCU_class
Language:C++00