wang-benqiang

wang-benqiang's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python35.4k 215 5.4k4.4k
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
Language:Python18.8k 130 1.1k1.3k
afatcoder/LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥
18.8k 291 642.7k
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Language:Python13.9k 116 1.1k1.9k
netease-youdao/QAnything
Question and Answer based on Anything.
Language:Python12k 104 4351.2k
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Language:Python8.3k 88 219604
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python7k 125 4461k
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python5.9k 56 280528
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.9k 52 321446
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.8k 38 1.5k432
wenge-research/YAYI
雅意大模型：为客户打造安全可靠的专属大模型，基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型，由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)
Language:Python3.3k 12 1144
modelscope/data-juicer
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
Language:Python3k 21 205181
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
Language:Rust2.9k 34 260185
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.6k 23 185212
Filimoa/open-parse
Improved file parsing for LLM’s
Language:Python2.5k 18 42100
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language:Python2.2k 33 251146
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.8k 24 3998
netease-youdao/BCEmbedding
Netease Youdao's open-source embedding and reranker models for RAG products.
Language:Python1.5k 8 87101
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Language:Python886 8 2246
alibaba/rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Language:C++550 12 9250
Beomi/InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Language:Python347 7 2431
InternLM/InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Language:Python312 10 8652
chen700564/RGB
Language:Python279 1 2225
WangRongsheng/Aurora
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
Language:Python257 7 1721
sunkx109/llama
Inference code for LLaMA models
Language:Python110 0 026
gameofdimension/vllm-cn
演示 vllm 对中文大语言模型的神奇效果
Language:Jupyter Notebook31 1 61
RUCAIBox/REAR
Implementation of "REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering"
Language:Python26 2 151
ahazeemi/RevDet
Robust and Memory Efficient Event Detection and Tracking in Large News Feeds
Language:Python11 2 14
Darrenzeng/MoE_Train
定制化构建qwen_moe架构，并实现训练和微调
Language:Python3 1 01
ericzhou571/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python1 0 00