YongGuCheng
Senior Researcher and Team Leader. Specialities: Programming (Python, C/C++), Math (optimization). PhD (TU Darmstadt), MPhil (HKUST), B.Eng. (ZJU)
WeBankShenzhen, PR China
YongGuCheng's Stars
JovenChu/embedding_model_test
基于开源embedding模型的中文向量效果测试
ninehills/llm-inference-benchmark
LLM Inference benchmark
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
ChenHsing/Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
liguodongiot/llm-resource
LLM全栈优质资源汇总
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
wgwang/awesome-LLMs-In-China
**大模型
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
RulinShao/FastCkpt
Python package for rematerialization-aware gradient checkpointing
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
google-research/vision_transformer
MrYxJ/calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
ray-project/llm-numbers
Numbers every LLM developer should know
hiyouga/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
mudler/LocalAI
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
FlowiseAI/Flowise
Drag & drop UI to build your customized LLM flow
aws-samples/question-answering-large-documents
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
LlamaFamily/Llama-Chinese
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
lllyasviel/ControlNet
Let us control diffusion models!
GanymedeNil/document.ai
基于向量数据库与GPT3.5的通用本地知识库方案(A universal local knowledge base solution based on vector database and GPT3.5)
Dao-AILab/flash-attention
Fast and memory-efficient exact attention