arjunchandra

Dad. Expanding collective potential at Brua IO. Lead large model training stability at Graphcore. Examining the AI quest.

Brua IOOslo, Norway

Pinned Repositories

accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python0 1 00
arjunchandra.github.io
Log of views and explanations
Language:SCSS0 2 00
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 3 00
BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
Language:Python0 1 00
brain-tokyo-workshop
🧠🗼
Language:Python0 1 00
c-swm
Contrastive Learning of Structured World Models
Language:Python0 1 00
C51-DDQN-Keras
C51-DDQN in Keras
Language:Python0 2 00
caffe
Caffe: a fast open framework for deep learning.
Language:C++0 2 00
DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
1 2 00
wildrl
Scaling RL to go wild.
1 2 00

arjunchandra's Repositories

arjunchandra/talk-transcripts
Transcripts of Clojure-related talks
arjunchandra/dlbook_notation
LaTeX files for the Deep Learning book notation
Language:TeX
arjunchandra/turicreate
Turi Create simplifies the development of custom machine learning models.
Language:C++
arjunchandra/paac
Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning
Language:Python
arjunchandra/C51-DDQN-Keras
C51-DDQN in Keras
Language:Python
arjunchandra/safe_learning
Safe reinforcement learning with stability guarantees
Language:Python
arjunchandra/carnd
Projects during the Self-Driving Car Engineer Nanodegree program from Udacity
Language:Jupyter Notebook
arjunchandra/treeagent
Decision tree ensembles as RL policies
Language:Go
arjunchandra/solar_panels_rl
Repository for code experimenting with RL and Solar Tracking
Language:TeX
arjunchandra/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python
arjunchandra/DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
1
arjunchandra/osim-rl
Reinforcement learning environments with musculoskeletal models
Language:Python
arjunchandra/seq2seq-signal-prediction
Signal prediction with a seq2seq RNN model in TensorFlow
Language:Jupyter Notebook
arjunchandra/deep-rl-tensorflow
TensorFlow implementation of Deep Reinforcement Learning papers
Language:Python
arjunchandra/dnc
A TensorFlow implementation of the Differentiable Neural Computer.
Language:Python
arjunchandra/mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Language:C++
arjunchandra/qbsolv
qbsolv is a metaheuristic or partitioning solver that solves a potentially large quadratic unconstrained binary optimization (QUBO) problem by splitting it into pieces that are solved either on a D-Wave system or via a classical tabu solver.
Language:C
arjunchandra/cozmo-python-sdk
Anki Cozmo Python SDK
Language:Python
arjunchandra/nips2015_vrnn
Language:Python
arjunchandra/cle
Language:Python
arjunchandra/requests-for-research
A living collection of deep learning problems
Language:HTML
arjunchandra/imitation
Language:Python
arjunchandra/reinforcejs
Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients)
Language:HTML
arjunchandra/tensorflow-deepq
A deep Q learning demonstration using Google Tensorflow
Language:Jupyter Notebook
arjunchandra/dockerfiles
Compilation of Dockerfiles with automated builds enabled on the Docker Registry.
Language:Shell
arjunchandra/DeepMind-Atari-Deep-Q-Learner
The original code from the DeepMind article + my tweaks
Language:Lua
arjunchandra/deepframeworks
Evaluation of Deep Learning Frameworks
arjunchandra/veles
Distributed machine learning platform
Language:C++
arjunchandra/caffe
Caffe: a fast open framework for deep learning.
Language:C++
arjunchandra/Theano
Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.
Language:Python