leeyusheng's Stars
DefTruth/CUDA-Learn-Notes
🎉 Modern CUDA Learn Notes with PyTorch: fp32, fp16, bf16, fp8/int8, flash_attn, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.
mdnice/markdown-resume
:necktie:支持 Markdown 和富文本的在线简历排版工具
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
yzfly/Awesome-AGI-Agents
🤖 Awesome list of AGI Agents. Agents 精选资源合集.
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
greatghoul/remote-working
收集整理远程工作相关的资料
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
fr0gger/Awesome-GPT-Agents
A curated list of GPT agents for cybersecurity
SurviveSJTU/SurviveSJTUManual
更新2008年版本的《上海交通大学生存手册》gitbook发布于https://survivesjtu.gitbook.io/survivesjtumanual/
km1994/LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
microsoft/autogen
A programming framework for agentic AI 🤖
premAI-io/state-of-open-source-ai
:closed_book: Clarity in the current fast-paced mess of Open Source innovation
shiyemin/light-hf-proxy
A light proxy solution for HuggingFace hub.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
zchuz/CoT-Reasoning-Survey
[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
mohamed-chs/chatgpt-history-export-to-md
A script to effortlessly extract your entire ChatGPT data export from JSON files to nicely-formatted markdown files.
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
microsoft/DeepSpeedExamples
Example models using DeepSpeed
JushBJJ/Mr.-Ranedeer-AI-Tutor
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.