Pinned Repositories
CRT
[NAACL 2022] Confidentially Redacted Training (CRT)
DRW
[EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP
DVGCN
[ISBI 2019] Deep Voxel-Graph Convolution Network
Ginsew
[ICML 2023] Protecting Language Generation Models via Invisible Watermarking
HPD
[ACL 2022] Homomorphic projective distillation (HPD) for sentence embedding
NPPrompt
[ACL 2023] NPPrompt: Pre-trained Language Models Can be Fully Zero-Shot Learners
pf-decoding
Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs
Unigram-Watermark
[ICLR 2024] Provable Robust Watermarking for AI-Generated Text
WatermarkAttacker
Invisible Image Watermarks Are Provably Removable Using Generative AI
weak-to-strong
Weak-to-Strong Jailbreaking on Large Language Models
XuandongZhao's Repositories
XuandongZhao doesn’t have any repository yet.