zhanqiuzhang

University of Science and Technology of ChinaHefei, Anhui, China

zhanqiuzhang's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++68.2k 547 4k9.8k
OpenInterpreter/open-interpreter
A natural language interface for computers
Language:Python55.7k 415 9734.8k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.8k 385 1.7k4.3k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python37.5k 377 3186k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python32.5k 188 5583.5k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python19.3k 126 5261.9k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.7k 274 121812
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.5k 158 64821
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.7k 94 2k996
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.4k 36 2971k
kyutai-labs/moshi
Language:Python6.8k 78 82533
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.3k 52 1k634
meta-llama/llama-stack
Composable building blocks to build Llama Apps
Language:Python4.6k 130 166588
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
Language:Python3.8k 74 243968
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python3.6k 44 88372
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python3.2k 27 369202
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python3.1k 97 113278
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
2.2k 23 51114
BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Language:Python1.9k 26 34163
ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
Language:Python1.7k 20 79146
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.2k 32 8183
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
1.1k 12 425
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
1k 44 637
parthsarthi03/raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Language:Python970 11 41135
tencent-ailab/persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Language:Python893 17 861
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python715 8 7149
Neph0s/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
573 15 029
allenai/OLMoE
OLMoE: Open Mixture-of-Experts Language Models
Language:Jupyter Notebook461 10 1035
Pints-AI/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
Language:Python266 5 520
BaichuanSEED/BaichuanSEED.github.io
Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline"
Language:JavaScript18 1 00

zhanqiuzhang

zhanqiuzhang's Stars

ggerganov/llama.cpp

OpenInterpreter/open-interpreter

hpcaitech/ColossalAI

karpathy/nanoGPT

2noise/ChatTTS

microsoft/graphrag

BradyFU/Awesome-Multimodal-Large-Language-Models

RUCAIBox/LLMSurvey

NVIDIA/TensorRT-LLM

lucidrains/denoising-diffusion-pytorch

kyutai-labs/moshi

bitsandbytes-foundation/bitsandbytes

meta-llama/llama-stack

attardi/wikiextractor

huggingface/speech-to-speech

QwenLM/Qwen2-VL

gpt-omni/mini-omni

deepseek-ai/DeepSeek-Coder-V2

BAAI-Agents/Cradle

ymcui/Chinese-LLaMA-Alpaca-3

QwenLM/Qwen2-Audio

kvcache-ai/Mooncake

Xnhyacinth/Awesome-LLM-Long-Context-Modeling

parthsarthi03/raptor

tencent-ailab/persona-hub

princeton-nlp/SimPO

Neph0s/awesome-llm-role-playing-with-persona

allenai/OLMoE

Pints-AI/1.5-Pints

BaichuanSEED/BaichuanSEED.github.io