arjunchandra
Dad. Expanding collective potential at Brua IO. Lead large model training stability at Graphcore. Examining the AI quest.
Brua IOOslo, Norway
Pinned Repositories
accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
arjunchandra.github.io
Log of views and explanations
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
brain-tokyo-workshop
🧠🗼
c-swm
Contrastive Learning of Structured World Models
C51-DDQN-Keras
C51-DDQN in Keras
caffe
Caffe: a fast open framework for deep learning.
DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
wildrl
Scaling RL to go wild.
arjunchandra's Repositories
arjunchandra/wildrl
Scaling RL to go wild.
arjunchandra/accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
arjunchandra/arjunchandra.github.io
Log of views and explanations
arjunchandra/BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
arjunchandra/brain-tokyo-workshop
🧠🗼
arjunchandra/c-swm
Contrastive Learning of Structured World Models
arjunchandra/chat-ui
Open source codebase powering the HuggingChat app
arjunchandra/chatbot-ui
An open source ChatGPT UI.
arjunchandra/continuous-rl
arjunchandra/deep_rl_for_swarms
arjunchandra/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
arjunchandra/DGN
DGN Code
arjunchandra/GraphGPT
Extrapolating knowledge graphs from unstructured text using GPT-3 🕵️♂️
arjunchandra/HardRLWithYoutube
TensorFlow implementation of "Playing hard exploration games by watching YouTube"
arjunchandra/jraph
arjunchandra/llama-recipes
Examples and recipes for Llama model
arjunchandra/muzero-general
MuZero
arjunchandra/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
arjunchandra/papergraph
AI/ML citation graph with postgres + graphql
arjunchandra/playground
PlayGround: AI Research into Multi-Agent Learning.
arjunchandra/popart
Poplar Advanced Runtime for the IPU
arjunchandra/powerful-gnns
How Powerful are Graph Neural Networks?
arjunchandra/RL4LMs
A modular RL library to fine-tune language models to human preferences
arjunchandra/rlgraph
RLgraph: Flexible computation graphs for deep reinforcement learning
arjunchandra/sandbox-grounded-qa
A sandbox repo for grounded question answering with Cohere and Google Search
arjunchandra/Softmax-DQN
arjunchandra/text-generation-inference
Large Language Model Text Generation Inference
arjunchandra/trl
Train transformer language models with reinforcement learning.
arjunchandra/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
arjunchandra/v80
Proceedings of ICML 2018