YongGuCheng
Senior Researcher and Team Leader. Specialities: Programming (Python, C/C++), Math (optimization). PhD (TU Darmstadt), MPhil (HKUST), B.Eng. (ZJU)
WeBankShenzhen, PR China
Pinned Repositories
AdaBound
An optimizer that trains as fast as Adam and as good as SGD.
CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python and OpenCL.
Deep-Learning-for-Tracking-and-Detection
Collection of papers and other resources for object tracking and detection using deep learning
Detectron
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
dlib
A toolkit for making real world machine learning and data analysis applications in C++
FATE
An Industrial Level Federated Learning Framework for the Federated AI ecosystem
federated
A framework for implementing federated learning
federated-learning-with-grpc-docker
A simple application that uses docker and gRPC to demonstrate fedrated learning
fedlearn-algo
Fedlearn支持前沿算法研发的Python工具库 | Fedlearn algorithm toolkit for researchers
Reinforcement-Learning-for-A-Stock-Quant-Investiment
Reinforcement Learning Procedure for Quant Investment in Chinese A-Stock Market
YongGuCheng's Repositories
YongGuCheng/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
YongGuCheng/Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
YongGuCheng/calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
YongGuCheng/ControlNet
Let us control diffusion models!
YongGuCheng/DeepLearningSystem
Deep Learning System core principles introduction.
YongGuCheng/DeepSpeedExamples
Example models using DeepSpeed
YongGuCheng/Efficient-LLMs-Survey
Efficient Large Language Models: A Survey
YongGuCheng/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
YongGuCheng/FastCkpt
Python package for rematerialization-aware gradient checkpointing
YongGuCheng/Flowise
Drag & drop UI to build your customized LLM flow
YongGuCheng/langflow
⛓️ Langflow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
YongGuCheng/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
YongGuCheng/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
YongGuCheng/LLaMA-Efficient-Tuning
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
YongGuCheng/Llama2-Chinese
Llama中文社区,最好的中文Llama大模型,完全开源可商用
YongGuCheng/llm-action
LLM 实战
YongGuCheng/llm-inference-benchmark
LLM Inference benchmark
YongGuCheng/llm-numbers
Numbers every LLM developer should know
YongGuCheng/llm-resource
LLM全栈优质资源汇总
YongGuCheng/LLMs-In-China
**大模型
YongGuCheng/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
YongGuCheng/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
YongGuCheng/LocalAI
:robot: Self-hosted, community-driven, local OpenAI compatible API. Drop-in replacement for OpenAI running LLMs on consumer-grade hardware. Free Open Source OpenAI alternative. No GPU required. Runs ggml, GPTQ, onnx, TF compatible models: llama, gpt4all, rwkv, whisper, vicuna, koala, gpt4all-j, cerebras, falcon, dolly, starcoder, and many others
YongGuCheng/NeMo
NeMo: a toolkit for conversational AI
YongGuCheng/OpenRLHF
A Ray-based High-performance RLHF framework (for 34b+ models)
YongGuCheng/question-answering-large-documents
YongGuCheng/starcoder
Home of StarCoder: fine-tuning & inference!
YongGuCheng/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
YongGuCheng/text-generation-webui
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, OPT, and GALACTICA.
YongGuCheng/vision_transformer