amuni3's Stars
hamishs/JAX-RL
JAX implementations of various deep reinforcement learning algorithms.
amzn/auction-gym
AuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online advertising auctions.
google-deepmind/opro
official code for "Large Language Models as Optimizers"
Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
federicovergallo/SUMO-changing-lane-agent
Implementation of a reinforcement learning agent able to do autonomous changing lane using Sumo
awjuliani/successor_examples
Tutorials on learning and using successor representations.
KaiYan289/RLpapersnote
Toni-SM/skrl
Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab
haarnoja/softqlearning
Reinforcement Learning with Deep Energy-Based Policies
Neo2308/MellowMax-RL
Cloud0723/Offline-MLIRL
jxzhangjhu/awesome-LLM-controlled-decoding-generation
awesome-LLM-controlled-constrained-generation
Div99/IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
gucino/learning-Racetrack-environment-using-First-Visit-Monte-Carlo-SARSA-and-Q-Learning
vojtamolda/reinforcement-learning-an-introduction
Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).
sebjai/robust-risk-aware-rl
Some implementations from the paper robust risk aware reinforcement learning
leopard-ai/betty
Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization
lezcano/geotorch
Constrained optimization toolkit for PyTorch
benchopt/benchmark_bilevel
Benchmark for bi-level optimization solvers
crowsonkb/mdmm-jax
Gradient-based constrained optimization for JAX
kenjyoung/MinAtar
clementsw/risk-and-uncertainty
Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"
LucasCJYSDL/DGMs-for-Offline-Policy-Learning
This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning. We cover multiple deep generative models, including VAEs, GANs, Normalizing Flows, Transformers, and Diffusion Models.
amuni3/WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
zaiyan-x/RFQI
Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
zchoi/Awesome-Embodied-Agent-with-LLMs
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
kirui93/MasterThesis
The files contained in this repository were used for implementing the three models in my master thesis. My master thesis title is "Comparison of different portfolio optimization problems with different risk measures".
NrLabFreiburg/inverse-q-learning
ThibautTheate/Risk-Sensitive-Policy-with-Distributional-Reinforcement-Learning
Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional Reinforcement Learning".
cvxgrp/cvxpylayers
Differentiable convex optimization layers