Pinned Repositories
awesome-LLMs-In-China
中国大模型
CardGame
icp game
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
FasterTransformer
Transformer related optimization, including BERT, GPT
fastertransformer_backend
GemmaChinese
Chinese Community for Google Gemma LLM
http-oracle
http-oracle
http-oracle-call
http on oracle
llama.cpp
Port of Facebook's LLaMA model in C/C++
llama2-webui
Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference.
Rayrtfr's Repositories
Rayrtfr/FasterTransformer
Transformer related optimization, including BERT, GPT
Rayrtfr/fastertransformer_backend
Rayrtfr/llama.cpp
Port of Facebook's LLaMA model in C/C++
Rayrtfr/llama2-webui
Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference.
Rayrtfr/awesome-LLMs-In-China
中国大模型
Rayrtfr/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Rayrtfr/GemmaChinese
Chinese Community for Google Gemma LLM
Rayrtfr/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Rayrtfr/Llama2-Chinese
最好的中文Llama大模型
Rayrtfr/Mixtral-8x7B-Chinese
Mixtral-8x7B中文
Rayrtfr/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs