L16H7

Singapore

L16H7's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook97.3k 699 8k15.8k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70k 558 4.2k10.1k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python57.6k 460 1325.9k
gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
Language:Python52.7k 516 4856.9k
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
Language:Python41.4k 330 3.7k5.4k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python38.2k 382 3206.1k
chroma-core/chroma
the AI-native open-source embedding database
Language:Rust16.1k 90 1.2k1.4k
cpacker/MemGPT
Letta (fka MemGPT) is a framework for creating stateful LLM services.
Language:Python12.1k 116 7821.3k
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python11k 99 8171.1k
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.4k 122 91382
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python6k 38 186676
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Language:MATLAB4.2k 39 0554
Kent0n-Li/ChatDoctor
Language:Python3.5k 54 52406
karpathy/makemore
An autoregressive character-level language model for making more things
Language:Python2.7k 33 11699
GMvandeVen/continual-learning
PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.
Language:Jupyter Notebook1.6k 28 30315
dmmiller612/bert-extractive-summarizer
Easy to use extractive text summarization with BERT
Language:Python1.4k 25 111309
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Language:Python1.4k 8 95309
Victorwz/LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
Language:Python772 24 2070
pytorch-labs/LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
Language:Python478 8 917
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Language:Python467 6 2065
epfml/landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
Language:Python419 39 1536
cezannec/capsule_net_pytorch
Readable implementation of a Capsule Network as described in "Dynamic Routing Between Capsules" [Hinton et. al.]
Language:Jupyter Notebook372 20 3130
proroklab/VectorizedMultiAgentSimulator
VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.
Language:Python362 9 6072
Pints-AI/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
Language:Python278 5 624
adamkarvonen/chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.
Language:Python197 3 216
corl-team/xland-minigrid
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
Language:Python194 9 1515
dhruvramani/Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
Language:Python174 4 423
MarcoMeter/episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
Language:Python160 4 1419
Reytuag/transformerXL_PPO_JAX
Language:Python70 4 02
Itomigna2/Muesli-lunarlander
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
Language:Jupyter Notebook15 2 45

L16H7

L16H7's Stars

langchain-ai/langchain

ggerganov/llama.cpp

labmlai/annotated_deep_learning_paper_implementations

gpt-engineer-org/gpt-engineer

oobabooga/text-generation-webui

karpathy/nanoGPT

chroma-core/chroma

cpacker/MemGPT

Lightning-AI/litgpt

openlm-research/open_llama

vwxyzjn/cleanrl

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

Kent0n-Li/ChatDoctor

karpathy/makemore

GMvandeVen/continual-learning

dmmiller612/bert-extractive-summarizer

marlbenchmark/on-policy

Victorwz/LongMem

pytorch-labs/LeanRL

jannerm/trajectory-transformer

epfml/landmark-attention

cezannec/capsule_net_pytorch

proroklab/VectorizedMultiAgentSimulator

Pints-AI/1.5-Pints

adamkarvonen/chess_llm_interpretability

corl-team/xland-minigrid

dhruvramani/Transformers-RL

MarcoMeter/episodic-transformer-memory-ppo

Reytuag/transformerXL_PPO_JAX

Itomigna2/Muesli-lunarlander