Pinned Repositories
llama.cpp
LLM inference in C/C++
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
book-tw
Rust 程式設計語言(正體中文翻譯)
GaLore
GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
llama.cpp
LLM inference in C/C++
parson
Lightweight JSON library written in C.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
PenutChen's Repositories
PenutChen/book-tw
Rust 程式設計語言(正體中文翻譯)
PenutChen/GaLore
PenutChen/GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
PenutChen/llama.cpp
LLM inference in C/C++
PenutChen/parson
Lightweight JSON library written in C.
PenutChen/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.