amuni3

amuni3's Stars

hamishs/JAX-RL
JAX implementations of various deep reinforcement learning algorithms.
Language:Python193
amzn/auction-gym
AuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online advertising auctions.
Language:Jupyter Notebook14437
google-deepmind/opro
official code for "Large Language Models as Optimizers"
Language:Python39238
Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
Language:Python2.6k739
federicovergallo/SUMO-changing-lane-agent
Implementation of a reinforcement learning agent able to do autonomous changing lane using Sumo
Language:Python6516
awjuliani/successor_examples
Tutorials on learning and using successor representations.
Language:Jupyter Notebook5014
KaiYan289/RLpapersnote
382
Toni-SM/skrl
Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab
Language:Python51847
haarnoja/softqlearning
Reinforcement Learning with Deep Energy-Based Policies
Language:Python41194
Neo2308/MellowMax-RL
Language:Jupyter Notebook1
Cloud0723/Offline-MLIRL
Language:Python92
jxzhangjhu/awesome-LLM-controlled-decoding-generation
awesome-LLM-controlled-constrained-generation
161
Div99/IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
Language:Python19831
gucino/learning-Racetrack-environment-using-First-Visit-Monte-Carlo-SARSA-and-Q-Learning
Language:Python52
vojtamolda/reinforcement-learning-an-introduction
Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).
Language:Jupyter Notebook32974
sebjai/robust-risk-aware-rl
Some implementations from the paper robust risk aware reinforcement learning
Language:Jupyter Notebook3313
leopard-ai/betty
Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization
Language:Python32927
lezcano/geotorch
Constrained optimization toolkit for PyTorch
Language:Python64934
benchopt/benchmark_bilevel
Benchmark for bi-level optimization solvers
Language:Python346
crowsonkb/mdmm-jax
Gradient-based constrained optimization for JAX
Language:Python25
kenjyoung/MinAtar
Language:Python28256
clementsw/risk-and-uncertainty
Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"
1
LucasCJYSDL/DGMs-for-Offline-Policy-Learning
This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning. We cover multiple deep generative models, including VAEs, GANs, Normalizing Flows, Transformers, and Diffusion Models.
262
amuni3/WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
1
zaiyan-x/RFQI
Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
Language:Python223
zchoi/Awesome-Embodied-Agent-with-LLMs
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
89648
kirui93/MasterThesis
The files contained in this repository were used for implementing the three models in my master thesis. My master thesis title is "Comparison of different portfolio optimization problems with different risk measures".
Language:Jupyter Notebook31
NrLabFreiburg/inverse-q-learning
Language:Python134
ThibautTheate/Risk-Sensitive-Policy-with-Distributional-Reinforcement-Learning
Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional Reinforcement Learning".
Language:Python15
cvxgrp/cvxpylayers
Differentiable convex optimization layers
Language:Python1.8k159

amuni3

amuni3's Stars

hamishs/JAX-RL

amzn/auction-gym

google-deepmind/opro

Farama-Foundation/HighwayEnv

federicovergallo/SUMO-changing-lane-agent

awjuliani/successor_examples

KaiYan289/RLpapersnote

Toni-SM/skrl

haarnoja/softqlearning

Neo2308/MellowMax-RL

Cloud0723/Offline-MLIRL

jxzhangjhu/awesome-LLM-controlled-decoding-generation

Div99/IQ-Learn

gucino/learning-Racetrack-environment-using-First-Visit-Monte-Carlo-SARSA-and-Q-Learning

vojtamolda/reinforcement-learning-an-introduction

sebjai/robust-risk-aware-rl

leopard-ai/betty

lezcano/geotorch

benchopt/benchmark_bilevel

crowsonkb/mdmm-jax

kenjyoung/MinAtar

clementsw/risk-and-uncertainty

LucasCJYSDL/DGMs-for-Offline-Policy-Learning

amuni3/WCSAC

zaiyan-x/RFQI

zchoi/Awesome-Embodied-Agent-with-LLMs

kirui93/MasterThesis

NrLabFreiburg/inverse-q-learning

ThibautTheate/Risk-Sensitive-Policy-with-Distributional-Reinforcement-Learning

cvxgrp/cvxpylayers