llgithubll

llgithubll's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook93.1k 682 7.7k15k
meta-llama/llama
Inference code for Llama models
Language:Python55.8k 522 9629.5k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37k 429 1.6k3.2k
chenfei-wu/TaskMatrix
Language:Python34.5k 300 3523.3k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k 339 2684k
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
18k 369 241.5k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k 115 1k1.2k
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.5k 133 210847
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Language:Python9.1k 111 81540
nebuly-ai/nebuly
The user analytics platform for LLMs
Language:Python8.4k 93 202644
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.7k 143 47666
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7k 59 138481
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python6.9k 72 588710
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.9k 123 434997
open-mmlab/mmcv
OpenMMLab Computer Vision Foundation
Language:Python5.8k 84 1.2k1.6k
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.5k 50 290471
dalinvip/Awesome-ChatGPT
ChatGPT资料汇总学习，持续更新......
4.1k 79 5384
SurviveSJTU/SurviveSJTUManual
更新2008年版本的《上海交通大学生存手册》gitbook发布于https://survivesjtu.gitbook.io/survivesjtumanual/
3.9k 71 30462
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
Language:MDX3.8k 84 301589
gligen/GLIGEN
Open-Set Grounded Text-to-Image Generation
Language:Python2k 36 87148
microsoft/DeBERTa
The implementation of DeBERTa
Language:Python2k 42 123224
openai/sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
Language:Python1.5k 45 10192
bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.3k 24 144214
xlang-ai/UnifiedSKG
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
Language:Python547 12 3958
voidful/TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Language:Python539 11 2360
codecaution/Awesome-Mixture-of-Experts-Papers
A curated reading list of research in Mixture-of-Experts(MoE).
528 14 440
THU-KEG/EvaluationPapers4ChatGPT
Resource, Evaluation and Detection Papers for ChatGPT
452 22 724
shizhediao/ChatGPTPapers
Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.
317 10 118
KSESEU/LLMPapers
Papers & Works for large languange models (ChatGPT, GPT-3, Codex etc.).
Language:TeX303 9 128
microsoft/DialogLM
Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."
Language:Python137 13 910