L16H7's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
ggerganov/llama.cpp
LLM inference in C/C++
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
chroma-core/chroma
the AI-native open-source embedding database
cpacker/MemGPT
Letta (fka MemGPT) is a framework for creating stateful LLM services.
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Kent0n-Li/ChatDoctor
karpathy/makemore
An autoregressive character-level language model for making more things
GMvandeVen/continual-learning
PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.
dmmiller612/bert-extractive-summarizer
Easy to use extractive text summarization with BERT
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Victorwz/LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
pytorch-labs/LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
epfml/landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
cezannec/capsule_net_pytorch
Readable implementation of a Capsule Network as described in "Dynamic Routing Between Capsules" [Hinton et. al.]
proroklab/VectorizedMultiAgentSimulator
VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.
Pints-AI/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
adamkarvonen/chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.
corl-team/xland-minigrid
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
dhruvramani/Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
MarcoMeter/episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
Reytuag/transformerXL_PPO_JAX
Itomigna2/Muesli-lunarlander
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)