skandermoalla

PhD in Machine Learning at EPFL. Deep Reinforcement Learning.

@CLAIRE-Labo EPFLLausanne

Pinned Repositories

no-representation-no-trust
Codebase to fully reproduce the results of "No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO" (Moalla et al. 2024). Uses TorchRL and provides extensive tools for studying representation dynamics in policy optimization.
Language:Jupyter Notebook11 2 01
python-ml-research-template
A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/CLAIRE-Labo/no-representation-no-trust
Language:Shell55 3 73
smacv2
Language:Python184 5 3027
bet-reproduction
Reproducibility study of the paper Behavior Transformers: Cloning k modes with one stone.
Language:Python3 1 00
cresset
Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.
Language:Dockerfile0 1 00
CSE201-Top5
Basketball team Manager game simulation
Language:C++0 2 01
graphrnn
Reproducing GraphRNN
Language:Python0 1 00
pymarl2-smacv2-experiments
SMACv2 experiments.
Language:Python0 1 00
pytoych-benchmark
A toy pytorch benchmark serving as an example project started from the CLAIRE python ML research template.
Language:Shell20
smacv2
New version of SMAC
Language:Python1 1 00

skandermoalla's Repositories

skandermoalla/bet-reproduction
Reproducibility study of the paper Behavior Transformers: Cloning k modes with one stone.
Language:Python3 1 00
skandermoalla/pytoych-benchmark
A toy pytorch benchmark serving as an example project started from the CLAIRE python ML research template.
Language:Shell20
skandermoalla/smacv2
New version of SMAC
Language:Python1 1 00
skandermoalla/cresset
Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.
Language:Dockerfile0 1 00
skandermoalla/CSE201-Top5
Basketball team Manager game simulation
Language:C++0 2 01
skandermoalla/cvlab-kubernetes-guide
Instructions and utilities for use of EPFL's compute cluster.
Language:Python0 1 00
skandermoalla/epfl-cs433
EPFL Machine Learning Course, Fall 2022
Language:Jupyter Notebook0 1 00
skandermoalla/epic-guide.github.io
Guidebook for IC PhD life at EPFL
Language:HTML0 1 00
skandermoalla/graphrnn
Reproducing GraphRNN
Language:Python0 1 00
skandermoalla/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Language:Python0 0 00
skandermoalla/it-simplex
Language:Jupyter Notebook0 1 00
skandermoalla/MAA313
Language:Jupyter Notebook0 1 00
skandermoalla/pymarl2-smacv2-experiments
SMACv2 experiments.
Language:Python0 1 00
skandermoalla/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Language:Python1 0
skandermoalla/relay-policy-learning
Language:Python1 0
skandermoalla/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Language:Python1 0
skandermoalla/simple-framework-of-choice
Language:Jupyter Notebook1 0
skandermoalla/skandermoalla.github.io
Personal website.
Language:JavaScript1 0
skandermoalla/smac
SMAC: The StarCraft Multi-Agent Challenge
Language:Python1 0
skandermoalla/smacv2-oxwhirl
Language:Python1 0
skandermoalla/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
skandermoalla/tensordict
TensorDict is a pytorch dedicated tensor container.
Language:Python1 0
skandermoalla/TorchRL
Provides a development environment to develop on pytorch/tensordict and pytorch/torchrl.
Language:Shell

skandermoalla

Pinned Repositories

no-representation-no-trust

python-ml-research-template

smacv2

bet-reproduction

cresset

CSE201-Top5

graphrnn

pymarl2-smacv2-experiments

pytoych-benchmark

smacv2

skandermoalla's Repositories

skandermoalla/bet-reproduction

skandermoalla/pytoych-benchmark

skandermoalla/smacv2

skandermoalla/cresset

skandermoalla/CSE201-Top5

skandermoalla/cvlab-kubernetes-guide

skandermoalla/epfl-cs433

skandermoalla/epic-guide.github.io

skandermoalla/graphrnn

skandermoalla/Gymnasium

skandermoalla/it-simplex

skandermoalla/MAA313

skandermoalla/pymarl2-smacv2-experiments

skandermoalla/ppo-implementation-details

skandermoalla/relay-policy-learning

skandermoalla/rl

skandermoalla/simple-framework-of-choice

skandermoalla/skandermoalla.github.io

skandermoalla/smac

skandermoalla/smacv2-oxwhirl

skandermoalla/stable-baselines3

skandermoalla/tensordict

skandermoalla/TorchRL