max-andr

PhD student @ EPFL🇨🇭. Interested in robustness and generalization in LLMs.

EPFLLausanne

Pinned Repositories

jailbreakbench
An Open Robustness Benchmark for Jailbreaking Language Models [arXiv 2024]
Language:Python90 4 111
adversarial-random-search-gpt4
Adversarial Attacks on GPT-4 via Simple Random Search [Dec 2023]
Language:Jupyter Notebook39 3 01
joint-cnn-mrf
Implementation of "Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation"
Language:Python56 3 217
provable-robustness-max-linear-regions
Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]
Language:Jupyter Notebook31 6 26
provably-robust-boosting
Provably Robust Boosted Decision Stumps and Trees against Adversarial Attacks [NeurIPS 2019]
Language:Python49 6 211
relu_networks_overconfident
Why ReLU networks yield high-confidence predictions far away from the training data and how to mitigate the problem [CVPR 2019, oral]
Language:Python182 7 421
square-attack
Square Attack: a query-efficient black-box adversarial attack via random search [ECCV 2020]
Language:Python144 5 1326
robustbench
RobustBench: a standardized adversarial robustness benchmark [NeurIPS'21 Benchmarks and Datasets Track]
Language:Python610 9 9896
llm-adaptive-attacks
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [arXiv, Apr 2024]
Language:Shell110 3 37
understanding-fast-adv-training
Understanding and Improving Fast Adversarial Training [NeurIPS 2020]
Language:Python91 4 712

max-andr's Repositories

max-andr/relu_networks_overconfident
Why ReLU networks yield high-confidence predictions far away from the training data and how to mitigate the problem [CVPR 2019, oral]
Language:Python182 7 421
max-andr/square-attack
Square Attack: a query-efficient black-box adversarial attack via random search [ECCV 2020]
Language:Python144 5 1326
max-andr/joint-cnn-mrf
Implementation of "Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation"
Language:Python56 3 217
max-andr/provably-robust-boosting
Provably Robust Boosted Decision Stumps and Trees against Adversarial Attacks [NeurIPS 2019]
Language:Python49 6 211
max-andr/adversarial-random-search-gpt4
Adversarial Attacks on GPT-4 via Simple Random Search [Dec 2023]
Language:Jupyter Notebook39 3 01
max-andr/provable-robustness-max-linear-regions
Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]
Language:Jupyter Notebook31 6 26
max-andr/cross-lipschitz
Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation [NeurIPS 2017]
Language:Python18 5 03
max-andr/Papers-of-Robust-ML
Related papers for robust machine learning
5 2 0
max-andr/awesome-anomaly-detection
A curated list of awesome anomaly detection resources
1 2 0
max-andr/awesome-decision-tree-papers
A collection of research papers on decision, classification and regression trees with implementations.
1 2 0
max-andr/awesome-gradient-boosting-papers
A curated list of gradient boosting research papers with implementations.
1 2 0
max-andr/MIPVerify_data
Data for MIPVerify package.
Language:Python1 2 0
max-andr/Provable-Training-and-Verification-Approaches-Towards-Robust-Neural-Networks
This repo keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on popular datasets and paper categorization.
1 2 0
max-andr/robustbench
RobustBench: a standardized adversarial robustness benchmark [arXiv, Oct 2020]
Language:Python1 1 0
max-andr/robustml
Interfaces for defining Robust ML models and precisely specifying the threat models under which they claim to be secure.
Language:Python1 3 0
max-andr/max-andr.github.io
Personal website
Language:JavaScript1 0
max-andr/SwissUA
1 0

max-andr

Pinned Repositories

jailbreakbench

adversarial-random-search-gpt4

joint-cnn-mrf

provable-robustness-max-linear-regions

provably-robust-boosting

relu_networks_overconfident

square-attack

robustbench

llm-adaptive-attacks

understanding-fast-adv-training

max-andr's Repositories

max-andr/relu_networks_overconfident

max-andr/square-attack

max-andr/joint-cnn-mrf

max-andr/provably-robust-boosting

max-andr/adversarial-random-search-gpt4

max-andr/provable-robustness-max-linear-regions

max-andr/cross-lipschitz

max-andr/Papers-of-Robust-ML

max-andr/awesome-anomaly-detection

max-andr/awesome-decision-tree-papers

max-andr/awesome-gradient-boosting-papers

max-andr/MIPVerify_data

max-andr/Provable-Training-and-Verification-Approaches-Towards-Robust-Neural-Networks

max-andr/robustbench

max-andr/robustml

max-andr/max-andr.github.io

max-andr/SwissUA