leeyusheng

leeyusheng's Stars

DefTruth/CUDA-Learn-Notes
🎉 Modern CUDA Learn Notes with PyTorch: fp32, fp16, bf16, fp8/int8, flash_attn, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.
Language:Cuda1.2k130
mdnice/markdown-resume
:necktie:支持 Markdown 和富文本的在线简历排版工具
Language:JavaScript1.6k230
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.2k1.1k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.4k917
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python2.1k206
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
93370
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.3k202
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
17.9k1.4k
yzfly/Awesome-AGI-Agents
🤖 Awesome list of AGI Agents. Agents 精选资源合集.
30622
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python52258
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.6k504
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
Language:TypeScript3.9k425
greatghoul/remote-working
收集整理远程工作相关的资料
Language:Ruby9.9k823
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Language:Python4.8k399
fr0gger/Awesome-GPT-Agents
A curated list of GPT agents for cybersecurity
5.4k599
SurviveSJTU/SurviveSJTUManual
更新2008年版本的《上海交通大学生存手册》gitbook发布于https://survivesjtu.gitbook.io/survivesjtumanual/
3.9k459
km1994/LLMs_interview_notes
该仓库主要记录大模型（LLMs）算法工程师相关的面试题
1.4k99
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Language:Jupyter Notebook62.2k31.9k
microsoft/autogen
A programming framework for agentic AI 🤖
Language:Jupyter Notebook31.4k4.6k
premAI-io/state-of-open-source-ai
:closed_book: Clarity in the current fast-paced mess of Open Source innovation
Language:TeX1.5k88
shiyemin/light-hf-proxy
A light proxy solution for HuggingFace hub.
Language:Python435
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python27.6k4.1k
zchuz/CoT-Reasoning-Survey
[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
28610
mohamed-chs/chatgpt-history-export-to-md
A script to effortlessly extract your entire ChatGPT data export from JSON files to nicely-formatted markdown files.
Language:Python69332
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook11.9k1.7k
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
6.3k383
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6k1k
JushBJJ/Mr.-Ranedeer-AI-Tutor
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
28.6k3.3k
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Language:Python76660
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.9k4.1k