arjunchandra

Dad. Expanding collective potential at Brua IO. Lead large model training stability at Graphcore. Examining the AI quest.

Brua IOOslo, Norway

Pinned Repositories

accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python0 1 00
arjunchandra.github.io
Log of views and explanations
Language:SCSS0 2 00
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 3 00
BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
Language:Python0 1 00
brain-tokyo-workshop
🧠🗼
Language:Python0 1 00
c-swm
Contrastive Learning of Structured World Models
Language:Python0 1 00
C51-DDQN-Keras
C51-DDQN in Keras
Language:Python0 2 00
caffe
Caffe: a fast open framework for deep learning.
Language:C++0 2 00
DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
1 2 00
wildrl
Scaling RL to go wild.
1 2 00

arjunchandra's Repositories

arjunchandra/wildrl
Scaling RL to go wild.
1 2 00
arjunchandra/accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python0 1 00
arjunchandra/arjunchandra.github.io
Log of views and explanations
Language:SCSS0 2 00
arjunchandra/BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
Language:Python0 1 00
arjunchandra/brain-tokyo-workshop
🧠🗼
Language:Python0 1 00
arjunchandra/c-swm
Contrastive Learning of Structured World Models
Language:Python0 1 00
arjunchandra/chat-ui
Open source codebase powering the HuggingChat app
Language:Svelte0 0
arjunchandra/chatbot-ui
An open source ChatGPT UI.
Language:TypeScript1 0
arjunchandra/continuous-rl
Language:Python1 01
arjunchandra/deep_rl_for_swarms
Language:Python1 0
arjunchandra/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
Language:Python1 0
arjunchandra/DGN
DGN Code
Language:Python1 0
arjunchandra/GraphGPT
Extrapolating knowledge graphs from unstructured text using GPT-3 🕵️‍♂️
Language:JavaScript1 0
arjunchandra/HardRLWithYoutube
TensorFlow implementation of "Playing hard exploration games by watching YouTube"
Language:Python2 01
arjunchandra/jraph
Language:Python1 0
arjunchandra/llama-recipes
Examples and recipes for Llama model
Language:Python0 0
arjunchandra/muzero-general
MuZero
Language:Python1 0
arjunchandra/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Jupyter Notebook1 0
arjunchandra/papergraph
AI/ML citation graph with postgres + graphql
Language:Jupyter Notebook1 0
arjunchandra/playground
PlayGround: AI Research into Multi-Agent Learning.
Language:Python2 0
arjunchandra/popart
Poplar Advanced Runtime for the IPU
Language:C++1 0
arjunchandra/powerful-gnns
How Powerful are Graph Neural Networks?
Language:Python2 0
arjunchandra/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python1 0
arjunchandra/rlgraph
RLgraph: Flexible computation graphs for deep reinforcement learning
Language:Python2 0
arjunchandra/sandbox-grounded-qa
A sandbox repo for grounded question answering with Cohere and Google Search
Language:Python1 0
arjunchandra/Softmax-DQN
Language:Python1 0
arjunchandra/text-generation-inference
Large Language Model Text Generation Inference
Language:Python0 0
arjunchandra/trl
Train transformer language models with reinforcement learning.
Language:Python1 0
arjunchandra/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python1 0
arjunchandra/v80
Proceedings of ICML 2018
Language:TeX2 0