Pinned Repositories
jailbreakbench
An Open Robustness Benchmark for Jailbreaking Language Models [arXiv 2024]
adversarial-random-search-gpt4
Adversarial Attacks on GPT-4 via Simple Random Search [Dec 2023]
joint-cnn-mrf
Implementation of "Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation"
provable-robustness-max-linear-regions
Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]
provably-robust-boosting
Provably Robust Boosted Decision Stumps and Trees against Adversarial Attacks [NeurIPS 2019]
relu_networks_overconfident
Why ReLU networks yield high-confidence predictions far away from the training data and how to mitigate the problem [CVPR 2019, oral]
square-attack
Square Attack: a query-efficient black-box adversarial attack via random search [ECCV 2020]
robustbench
RobustBench: a standardized adversarial robustness benchmark [NeurIPS'21 Benchmarks and Datasets Track]
llm-adaptive-attacks
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [arXiv, Apr 2024]
sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
max-andr's Repositories
max-andr/relu_networks_overconfident
Why ReLU networks yield high-confidence predictions far away from the training data and how to mitigate the problem [CVPR 2019, oral]
max-andr/square-attack
Square Attack: a query-efficient black-box adversarial attack via random search [ECCV 2020]
max-andr/joint-cnn-mrf
Implementation of "Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation"
max-andr/provably-robust-boosting
Provably Robust Boosted Decision Stumps and Trees against Adversarial Attacks [NeurIPS 2019]
max-andr/adversarial-random-search-gpt4
Adversarial Attacks on GPT-4 via Simple Random Search [Dec 2023]
max-andr/provable-robustness-max-linear-regions
Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]
max-andr/cross-lipschitz
Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation [NeurIPS 2017]
max-andr/Papers-of-Robust-ML
Related papers for robust machine learning
max-andr/awesome-anomaly-detection
A curated list of awesome anomaly detection resources
max-andr/awesome-decision-tree-papers
A collection of research papers on decision, classification and regression trees with implementations.
max-andr/awesome-gradient-boosting-papers
A curated list of gradient boosting research papers with implementations.
max-andr/MIPVerify_data
Data for MIPVerify package.
max-andr/Provable-Training-and-Verification-Approaches-Towards-Robust-Neural-Networks
This repo keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on popular datasets and paper categorization.
max-andr/robustbench
RobustBench: a standardized adversarial robustness benchmark [arXiv, Oct 2020]
max-andr/robustml
Interfaces for defining Robust ML models and precisely specifying the threat models under which they claim to be secure.
max-andr/max-andr.github.io
Personal website
max-andr/SwissUA