clearloveclearlove

Pinned Repositories

Alisa
ALiSa: Acrostic Linguistic Steganography Based on BERT and Gibbs Sampling
Language:Python8 2 02
BadActs
Language:Python4 1 00
BEAT
Language:Jupyter Notebook40
Revisiting-NLP-Backdoor
0 1 00
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python42.8k 244 6.1k5.2k
I-GCG
Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)
Language:Python81 2 45
AmpleGCG
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
Language:Python55 2 25
learning_research
本人的科研经验
6.4k 73 31382
BEEAR
This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models".
Language:HTML14 1 11
detection_logits
Language:Python1 2 11

clearloveclearlove's Repositories

clearloveclearlove/Alisa
ALiSa: Acrostic Linguistic Steganography Based on BERT and Gibbs Sampling
Language:Python8 2 02
clearloveclearlove/BadActs
Language:Python4 1 00
clearloveclearlove/BEAT
Language:Jupyter Notebook40
clearloveclearlove/Revisiting-NLP-Backdoor
0 1 00