ml-safety

There are 19 repositories under ml-safety topic.

Giskard-AI/giskard
🐢 Open-Source Evaluation & Testing for AI & LLM systems
Language:Python4.2k 33 463279
hendrycks/robustness
Corruption and Perturbation Robustness (ICLR 2019)
Language:Python1k 14 55145
hendrycks/natural-adv-examples
A Harder ImageNet Test Set (CVPR 2021)
Language:Python597 12 1452
hendrycks/outlier-exposure
Deep Anomaly Detection with Outlier Exposure (ICLR 2019)
Language:Python550 19 24108
JohnSnowLabs/langtest
Deliver safe & effective language models
Language:Python505 10 45841
agencyenterprise/PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022
Language:Python321 11 232
hendrycks/ss-ood
Self-Supervised Learning for OOD Detection (NeurIPS 2019)
Language:Python266 8 2531
hendrycks/ethics
Aligning AI With Shared Human Values (ICLR 2021)
Language:Python263 9 744
hendrycks/imagenet-r
ImageNet-R(endition) and DeepAugment (ICCV 2021)
Language:Python258 8 918
jiachens/ModelNet40-C
Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296
Language:Python217 14 1725
Giskard-AI/awesome-ai-safety
📚 A curated list of papers & technical articles on AI Quality & Safety
166 3 014
hendrycks/anomaly-seg
The Combined Anomalous Object Segmentation (CAOS) Benchmark
Language:Python157 10 3020
hendrycks/pre-training
Pre-Training Buys Better Robustness and Uncertainty Estimates (ICML 2019)
Language:Python100 7 717
YyzHarry/ME-Net
[ICML 2019] ME-Net: Towards Effective Adversarial Robustness with Matrix Estimation
Language:Python53 3 610
hendrycks/jiminy-cricket
Jiminy Cricket Environment (NeurIPS 2021)
Language:ZAP24 4 03
yaodongyu/ProjNorm
Predicting Out-of-Distribution Error with the Projection Norm
Language:Python17 2 31
moonwatcher-ai/moonwatcher
Evaluation & testing framework for computer vision models
Language:Python16 2 02
harsmac/MUFIACode
Code for the attack multiplicative filter attack MUFIA, from the paper "Frequency-based vulnerability analysis of deep learning models against image corruptions".
Language:Python3 1 00
ArianeDlns/adv-AI-project
This repository contains the project for the Advanced AI course @CentraleSupélec
Language:Jupyter Notebook2 2 00

ml-safety

Giskard-AI/giskard

hendrycks/robustness

hendrycks/natural-adv-examples

hendrycks/outlier-exposure

JohnSnowLabs/langtest

agencyenterprise/PromptInject

hendrycks/ss-ood

hendrycks/ethics

hendrycks/imagenet-r

jiachens/ModelNet40-C

Giskard-AI/awesome-ai-safety

hendrycks/anomaly-seg

hendrycks/pre-training

YyzHarry/ME-Net

hendrycks/jiminy-cricket

yaodongyu/ProjNorm

moonwatcher-ai/moonwatcher

harsmac/MUFIACode

ArianeDlns/adv-AI-project