Pinned Repositories
nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
avellaneda-stoikov
Avellaneda-Stoikov HFT market making algorithm implementation
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
maddpg-rllib
MADDPG in Ray/RLlib
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
questdb
An open source time-series database for fast ingest and SQL queries
restorePhotos
Restoring old and blurry face photos with AI.
sanic
Async Python 3.7+ web server/framework | Build fast. Run fast.
spinningup
An educational resource to help anyone learn deep reinforcement learning.
langchain
🦜🔗 Build context-aware reasoning applications
sanjeevanahilan's Repositories
sanjeevanahilan/nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
sanjeevanahilan/questdb
An open source time-series database for fast ingest and SQL queries
sanjeevanahilan/restorePhotos
Restoring old and blurry face photos with AI.
sanjeevanahilan/avellaneda-stoikov
Avellaneda-Stoikov HFT market making algorithm implementation
sanjeevanahilan/sanic
Async Python 3.7+ web server/framework | Build fast. Run fast.
sanjeevanahilan/spinningup
An educational resource to help anyone learn deep reinforcement learning.
sanjeevanahilan/maddpg-rllib
MADDPG in Ray/RLlib
sanjeevanahilan/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
sanjeevanahilan/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms