epignatelli

Currently PhD candidate in RL at UCL. Previously ML Lead at @BuroHappoldEngineering and RA at @ImperialCollegeLondon

University College London (UCL)London

Pinned Repositories

BHoM
The Buildings and Habitats Core object Model repo
Language:C#224 23 90147
cardiax
A python implementation of the fenton karma model using fourth order accuracy central finite difference method, euler update scheme, and jax
Language:Jupyter Notebook4 1 182
discovering-reinforcement-learning-algorithms
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. and Silver, D., 2020. Discovering reinforcement learning algorithms. Advances in Neural Information Processing Systems, 33.
Language:Python20 1 15
helx
Interoperating between (Deep) Reiforcement Learning libraries
Language:Python10 2 376
human-level-control-through-deep-reinforcement-learning
A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G. and Petersen, S., 2015. Human-level control through deep reinforcement learning. nature, 518(7540), pp.529-533.
Language:Python7 0 04
navix
Accelerated minigrid environments with JAX
Language:Python104 1 298
reinforcement-learning-an-introduction
A python implementation of the concepts in the book "Reinforcement Learning: An Introduction" by R.S. Sutton and A. G. Barto.
Language:Jupyter Notebook18 4 18
scalable-recognition-with-a-vocabulary-tree
A python implementation of the paper "Scalable Recognition with a Vocabulary Tree, D. Nister, H. Stewenius, 2006"
Language:Jupyter Notebook15 4 108
bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
Language:Python1.5k 60 31182
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python30.1k 334 5.6k2.8k

epignatelli's Repositories

epignatelli/cardiax
A python implementation of the fenton karma model using fourth order accuracy central finite difference method, euler update scheme, and jax
Language:Jupyter Notebook4 1 182
epignatelli/OpenRGB
Language:C++1 1 01
epignatelli/aaai-template
latex template for various conferences, as well as wise-man's overleaf (overleaf is terrible!)
Language:TeX0 0
epignatelli/boo
The Boo Programming Language.
epignatelli/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
Language:Python0 0
epignatelli/bump-n-tag
A Github Action to automatically bump and tag master, on merge, with the latest SemVer formatted version. Works on any platform.
Language:TypeScript0 0
epignatelli/cardiax_manuscript
Language:TeX1 0
epignatelli/curated-rl-bibtex
A mantained collection of curated bibtex of the most relevant reinforcement learning publications
Language:TeX1 0
epignatelli/deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
Language:Python0 0
epignatelli/epignatelli
1 0
epignatelli/Font-Awesome
The iconic SVG, font, and CSS toolkit
Language:JavaScript0 0
epignatelli/gitignore
A collection of useful .gitignore templates
0 0
epignatelli/GraphINVENT
Graph neural networks for molecular design.
Language:Python0 0
epignatelli/gridworld-x
Language:Python1 0
epignatelli/gyax
GPU-parallelizable gym environments based on a jax backend
1 0
epignatelli/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
Language:Python0 0
epignatelli/gym3
Vectorized interface for reinforcement learning environments
Language:Python
epignatelli/optax
Optax is a gradient processing and optimization library for JAX.
Language:Python0 0
epignatelli/prioritized-experience-replay
Language:Python1 0
epignatelli/publication
Language:TeX1 0
epignatelli/pyage2
"Age of Empires II" Learning Environment
Language:Python0 0
epignatelli/python-producer-consumer
Language:Python1 0
epignatelli/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python0 0
epignatelli/setup
Language:Shell1 0
epignatelli/so-simple-theme
A simple Jekyll theme for words and pictures.
Language:SCSS0 0
epignatelli/spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
Language:Python0 0
epignatelli/synthetic-returns-for-long-term-credit-assignment
Language:Python1 0
epignatelli/the-paper-series
The paper series is a collection of unofficial implementations of reknown Deep Reinforcement Algorithms.
1 0
epignatelli/theapa.bst-fixed
1 0
epignatelli/tracking-nonstationarity-via-online-importance-sampling
Language:Jupyter Notebook1 0