trustworthy-ai

There are 91 repositories under trustworthy-ai topic.

Trusted-AI/adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Language:Python4.6k 101 8751.1k
Giskard-AI/giskard
🐢 Open-Source Evaluation & Testing for LLMs and ML models
Language:Python3.5k 27 427221
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Language:Jupyter Notebook1.5k 20 239186
THUYimingLi/BackdoorBox
The open-sourced Python toolbox for backdoor attacks and defenses.
Language:Python384 8 1162
HowieHwong/TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
Language:Python332 6 1629
yunqing-me/AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Language:Python133 2 204
liuzuxin/FSRL
🚀 A fast safe reinforcement learning library in PyTorch
Language:Python131 2 222
yunqing-me/WatermarkDM
Code of the paper: A Recipe for Watermarking Diffusion Models
Language:Jupyter Notebook110 2 175
verivital/nnv
Neural Network Verification Software Tool
101 7 4648
ffhibnese/Model-Inversion-Attack-ToolBox
A comprehensive toolbox for model inversion attacks and defenses, which is easy to get started.
Language:Python94 2 02
ml-for-high-risk-apps-book/Machine-Learning-for-High-Risk-Applications-Book
Official code repo for the O'Reilly Book - Machine Learning for High-Risk Applications
Language:Jupyter Notebook92 6 122
aiverify-foundation/aiverify
AI Verify
Language:Python9025
IBM/ai-privacy-toolkit
A toolkit for tools and techniques related to the privacy and compliance of AI models.
Language:Python89 10 2926
dlmacedo/entropic-out-of-distribution-detection
A project to add scalable state-of-the-art out-of-distribution detection (open set recognition) support by changing two lines of code! Perform efficient inferences (i.e., do not increase inference time) and detection without classification accuracy drop, hyperparameter tuning, or collecting additional data.
Language:Python75 4 410
qitianwu/GraphOOD-GNNSafe
The official implementation for ICLR23 paper "GNNSafe: Energy-based Out-of-Distribution Detection for Graph Neural Networks"
Language:Python68 4 44
ai4ce/FLAT
[ICCV2021 Oral] Fooling LiDAR by Attacking GPS Trajectory
Language:Python66 5 010
JerryX1110/Robust-Video-Object-Segmentation
[ACM MM22] Towards Robust Video Object Segmentation with Adaptive Object Calibration, ACM Multimedia 2022
Language:Python48 5 45
szandala/TorchPRISM
Principal Image Sections Mapping. Convolutional Neural Network Visualisation and Explanation Framework
Language:Python47 4 17
dlmacedo/distinction-maximization-loss
A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in your project! Perform efficient inferences (i.e., do not increase inference time) without repetitive model training, hyperparameter tuning, or collecting additional data.
Language:Python45 3 45
95616ARG/SyReNN
SyReNN: Symbolic Representations for Neural Networks
Language:Python38 5 25
AthenaCore/AwesomeResponsibleAI
A curated list of awesome academic research, books, code of ethics, data sets, institutes, newsletters, principles, podcasts, reports, tools, regulations and standards related to Responsible AI, Trustworthy AI, and Human-Centered AI.
38 4 08
zhihengli-UR/StyleT2I
Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)
Language:Python37 8 03
sleeepeer/PoisonedRAG
code & data of PoisonedRAG paper
Language:Python352
moonshot-admin/moonshot
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
Language:Python3210
zRapha/FAME
Framework for Adversarial Malware Evaluation.
Language:Python32 3 59
sail-sg/finetune-fair-diffusion
Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness
Language:Python29 8 2
ffhibnese/GIFD_Gradient_Inversion_Attack
[ICCV-2023] Gradient inversion attack, Federated learning, Generative adversarial network.
Language:Python272
TMIS-Turbo/FNI-RL
[TPAMI, 2023] Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving
Language:Python267
Crisp-Unimib/ContrXT
a tool for comparing the predictions of any text classifiers
Language:Python24 3 02
zhihengli-UR/DebiAN
Official code of "Discover and Mitigate Unknown Biases with Debiasing Alternate Networks" (ECCV 2022)
Language:Python23 2 03
yuji-roh/fairbatch
FairBatch: Batch Selection for Model Fairness (ICLR 2021)
Language:Python20 2 24
zhihengli-UR/discover_unknown_biases
Official code of "Discover the Unknown Biased Attribute of an Image Classifier" (ICCV 2021)
Language:Python19 1 01
LucasFidon/trustworthy-ai-fetal-brain-segmentation
Trustworthy AI method based on Dempster-Shafer theory - application to fetal brain 3D T2w MRI segmentation
Language:Python18 3 42
Crisp-Unimib/MERLIN
MERLIN is a global, model-agnostic, contrastive explainer for any tabular or text classifier. It provides contrastive explanations of how the behaviour of two machine learning models differs.
Language:Python17 4 05
dlmacedo/robust-deep-learning
A project to train your model from scratch or fine-tune a pretrained model using the losses provided in this library to improve out-of-distribution detection and uncertainty estimation performances. Calibrate your model to produce enhanced uncertainty estimations. Detect out-of-distribution data using the defined score type and threshold.
Language:Python16 1 03
seedatnabeel/Data-IQ
Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data (NeurIPS 2022)
Language:Jupyter Notebook13 2 14

trustworthy-ai

Trusted-AI/adversarial-robustness-toolbox

Giskard-AI/giskard

zjunlp/EasyEdit

THUYimingLi/BackdoorBox

HowieHwong/TrustLLM

yunqing-me/AttackVLM

liuzuxin/FSRL

yunqing-me/WatermarkDM

verivital/nnv

ffhibnese/Model-Inversion-Attack-ToolBox

ml-for-high-risk-apps-book/Machine-Learning-for-High-Risk-Applications-Book

aiverify-foundation/aiverify

IBM/ai-privacy-toolkit

dlmacedo/entropic-out-of-distribution-detection

qitianwu/GraphOOD-GNNSafe

ai4ce/FLAT

JerryX1110/Robust-Video-Object-Segmentation

szandala/TorchPRISM

dlmacedo/distinction-maximization-loss

95616ARG/SyReNN

AthenaCore/AwesomeResponsibleAI

zhihengli-UR/StyleT2I

sleeepeer/PoisonedRAG

moonshot-admin/moonshot

zRapha/FAME

sail-sg/finetune-fair-diffusion

ffhibnese/GIFD_Gradient_Inversion_Attack

TMIS-Turbo/FNI-RL

Crisp-Unimib/ContrXT

zhihengli-UR/DebiAN

yuji-roh/fairbatch

zhihengli-UR/discover_unknown_biases

LucasFidon/trustworthy-ai-fetal-brain-segmentation

Crisp-Unimib/MERLIN

dlmacedo/robust-deep-learning

seedatnabeel/Data-IQ