fratim

Pinned Repositories

3DHumanPose
17-joint 3D Human Pose estimation from Single RGB Images
00
acme
A library of reinforcement learning components and agents
Language:Python00
connected-components-3d
26, 18, and 6 Connected Multi-Label Connected Components on 3D Images
Language:Python10
diffusion-relative-rewards
Code for the 2023 NeurIPS paper "Extracting Reward Functions from Diffusion Models"
Language:Python00
HelloFresh
Code for the 2024 ACL paper "HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits".
Language:Python00
Illu-Attacks-Jax
Code for the ICLR 2024 Paper "Illusory Attacks: Information-theoretic detectability matters in adversarial attacks"
Language:Python10
mpc_climate_control
Model Predictive Control Algorithm for temperature regulation of a delivery truck
Language:MATLAB10
SelectToPerfect
Code for the 2024 ICLR paper "Select to Perfect: Imitating desired behavior from large multi-agent data"
40
skeletons
Skeleton generation for neural circuits.
Language:C++10
UDIL
Language:HTML10

fratim's Repositories

fratim/SelectToPerfect
Code for the 2024 ICLR paper "Select to Perfect: Imitating desired behavior from large multi-agent data"
40
fratim/Illu-Attacks-Jax
Code for the ICLR 2024 Paper "Illusory Attacks: Information-theoretic detectability matters in adversarial attacks"
Language:Python10
fratim/UDIL
Language:HTML10
fratim/acme
A library of reinforcement learning components and agents
Language:Python00
fratim/auto-attack
Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"
Language:Python00
fratim/diffusion-relative-rewards
Code for the 2023 NeurIPS paper "Extracting Reward Functions from Diffusion Models"
Language:Python00
fratim/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
Language:Python00
fratim/HelloFresh
Code for the 2024 ACL paper "HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits".
Language:Python00
fratim/rl-starter-files
RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code
Language:Python00
fratim/torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Language:Python00
fratim/B16Examples
Language:C++
fratim/gym-multigrid
Lightweight multi-agent gridworld Gym environment
Language:Python
fratim/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python
fratim/Inverse-Reinforcement-Learning
Implementations of selected inverse reinforcement learning algorithms.
fratim/irl-maxent
Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python
fratim/lbf
A multi-agent environment for RL
Language:Python
fratim/MADDPG
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
fratim/minerl
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
Language:Java
fratim/minerl2020_submission
Language:Python
fratim/minerl_singularity
fratim/nips_figures
Language:Python
fratim/PettingZoo
Gym for multi-agent reinforcement learning
Language:Python
fratim/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Language:Python
fratim/Pyro4
Pyro 4.x - Python remote objects
fratim/seals
Benchmark environments for reward modelling and imitation learning algorithms.
fratim/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python
fratim/udil-code
fratim/vfunctions
Value Functions
Language:Python
fratim/website_simple
A beautiful, simple, clean, and responsive Jekyll theme for academics
Language:JavaScript
fratim/yaspi
yaspi - Yet Another Slurm Python Interface