Pinned Repositories
academicpages.github.io
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.
llm-quantization-attack
muse_bench
llm_attack_defense_arena
CREAM
Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.
Generalizable-Reward-Model
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
DPGBA
An official implementation of "Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective" (KDD 2024)
FailureLLMUnlearning
An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)
RIGBD
An official implementation of "Robustness Inspired Graph Backdoor Defense" (ICLR 2025 Oral)
zhiweizhang.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
zzwjames's Repositories
zzwjames/FailureLLMUnlearning
An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)
zzwjames/DPGBA
An official implementation of "Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective" (KDD 2024)
zzwjames/RIGBD
An official implementation of "Robustness Inspired Graph Backdoor Defense" (ICLR 2025 Oral)
zzwjames/zhiweizhang.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes