Hazelsuko07's Stars
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
openfl/openfl
An open source library for creative expression on the web, desktop, mobile and consoles. Inspired by the classic Flash and AIR APIs.
facebookresearch/CrypTen
A framework for Privacy Preserving Machine Learning
OpenMined/PyGrid-deprecated---see-PySyft-
A Peer-to-peer Platform for Secure, Privacy-preserving, Decentralized Data Science
automl/NASLib
NASLib is a Neural Architecture Search (NAS) library for facilitating NAS research for the community by providing interfaces to several state-of-the-art NAS search spaces and optimizers.
Eric-Wallace/universal-triggers
Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)
google-research/lm-extraction-benchmark
neulab/knn-transformers
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT
FedML-AI/FedNLP
FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022
OpenMined/PyVertical
Privacy Preserving Vertical Federated Learning
Princeton-SysML/GradAttack
GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning, as well as corresponding mitigation strategies.
Princeton-SysML/Jailbreak_LLM
da03/Internalize_CoT_Step_by_Step
yu4u/dnn-watermark
Implementation of "Embedding Watermarks into Deep Neural Networks," in Proc. of ICMR'17.
adiyoss/WatermarkNN
Watermarking Deep Neural Networks (USENIX 2018)
ammartahir24/SecureAggregation
An implementation of Secure Aggregation algorithm based on "Practical Secure Aggregation for Privacy-Preserving Machine Learning (Bonawitz et. al)" in Python.
boyiwei/alignment-attribution-code
[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Princeton-SysML/FILM
Official repo for the paper: Recovering Private Text in Federated Learning of Language Models (in NeurIPS 2022)
DingfanChen/GAN-Leaks
Official implementation of "GAN-Leaks: A Taxonomy of Membership Inference Attacks against Generative Models" (CCS 2020)
SORRY-Bench/sorry-bench
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
AlphaPav/mem-kk-logic
On Memorization of Large Language Models in Logical Reasoning
ml-postech/gradient-inversion-generative-image-prior
corentingiraud/federated-learning-secure-aggregation
A simple Python implementation of a secure aggregation protocole for federated learning.
swj0419/muse_bench
BatsResearch/cross-lingual-detox
Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024
boyiwei/CoTaEval
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
ethz-spylab/unlearning-vs-safety
princeton-nlp/CopyCat
AI-Law-Society-Lab/Evaluating-Durable-Safeguards
daogaoliu/unlearning-under-adversary