Pinned Repositories
3DHumanPose
17-joint 3D Human Pose estimation from Single RGB Images
acme
A library of reinforcement learning components and agents
connected-components-3d
26, 18, and 6 Connected Multi-Label Connected Components on 3D Images
diffusion-relative-rewards
Code for the 2023 NeurIPS paper "Extracting Reward Functions from Diffusion Models"
HelloFresh
Code for the 2024 ACL paper "HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits".
Illu-Attacks-Jax
Code for the ICLR 2024 Paper "Illusory Attacks: Information-theoretic detectability matters in adversarial attacks"
mpc_climate_control
Model Predictive Control Algorithm for temperature regulation of a delivery truck
SelectToPerfect
Code for the 2024 ICLR paper "Select to Perfect: Imitating desired behavior from large multi-agent data"
skeletons
Skeleton generation for neural circuits.
UDIL
fratim's Repositories
fratim/SelectToPerfect
Code for the 2024 ICLR paper "Select to Perfect: Imitating desired behavior from large multi-agent data"
fratim/Illu-Attacks-Jax
Code for the ICLR 2024 Paper "Illusory Attacks: Information-theoretic detectability matters in adversarial attacks"
fratim/UDIL
fratim/acme
A library of reinforcement learning components and agents
fratim/auto-attack
Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"
fratim/diffusion-relative-rewards
Code for the 2023 NeurIPS paper "Extracting Reward Functions from Diffusion Models"
fratim/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
fratim/HelloFresh
Code for the 2024 ACL paper "HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits".
fratim/rl-starter-files
RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code
fratim/torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
fratim/B16Examples
fratim/gym-multigrid
Lightweight multi-agent gridworld Gym environment
fratim/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
fratim/Inverse-Reinforcement-Learning
Implementations of selected inverse reinforcement learning algorithms.
fratim/irl-maxent
Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python
fratim/lbf
A multi-agent environment for RL
fratim/MADDPG
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
fratim/minerl
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
fratim/minerl2020_submission
fratim/minerl_singularity
fratim/nips_figures
fratim/PettingZoo
Gym for multi-agent reinforcement learning
fratim/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
fratim/Pyro4
Pyro 4.x - Python remote objects
fratim/seals
Benchmark environments for reward modelling and imitation learning algorithms.
fratim/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
fratim/udil-code
fratim/vfunctions
Value Functions
fratim/website_simple
A beautiful, simple, clean, and responsive Jekyll theme for academics
fratim/yaspi
yaspi - Yet Another Slurm Python Interface