boyiwei

PhD student @ Princeton ECE.

Princeton UniversityPrinceton, NJ

Pinned Repositories

Evaluating-Durable-Safeguards
[ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs
Language:Python12 1 01
alignment-attribution-code
[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Language:Python73 1 119
boyiwei.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
COS598D-Pruning
Assignments for COS598D: System and Machine Learning
Language:Jupyter Notebook00
cos598d_sp24
Language:Python00
CoTaEval
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
Language:Python17 1 35
ReG-NAS
Language:Python2 1 00
RepNoise-Reproduce
Language:Jupyter Notebook1 1 00
tamper-resistance
Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"
Language:Python00
TAR-Reproduce
Language:Python1 1 00

boyiwei's Repositories

boyiwei/alignment-attribution-code
[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Language:Python73 1 119
boyiwei/CoTaEval
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
Language:Python17 1 35
boyiwei/ReG-NAS
Language:Python2 1 00
boyiwei/RepNoise-Reproduce
Language:Jupyter Notebook1 1 00
boyiwei/TAR-Reproduce
Language:Python1 1 00
boyiwei/boyiwei.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
boyiwei/COS598D-Pruning
Assignments for COS598D: System and Machine Learning
Language:Jupyter Notebook00
boyiwei/cos598d_sp24
Language:Python00
boyiwei/tamper-resistance
Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"
Language:Python00