chrisliu298

PhD student @UCSC CSE | Research Intern at @SkyworkAI

University of California, Santa CruzSanta Cruz, California

Pinned Repositories

awesome-llm-unlearning
A resource repository for machine unlearning in large language models
225 6 213
awesome-representation-engineering
A resource repository for representation engineering in large language models
54 1 12
awesome-sparse-autoencoders
A resource repository of sparse autoencoders for large language models
2 1 00
gpt2-arxiv
Fine-tuning GPT-2 to generate research paper abstracts
Language:Python11 1 12
llm-unlearn-eco
[NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts
Language:Python12 4 00
min_double_descent
A minimal example of double descent
Language:Python0 1 01
resnet-tinyimagenet
I trained ResNet models using the Tiny ImageNet dataset.
Language:Python5 1 00
roberta-imdb
IMDb sentiment analysis with RoBERTa
Language:Python2 2 00
Skywork-Reward
Rank 1 and 3 reward models on RewardBench
00
tapt
Data augmentation by generating new samples
Language:Jupyter Notebook6 2 00

chrisliu298's Repositories

chrisliu298/awesome-llm-unlearning
A resource repository for machine unlearning in large language models
225 6 213
chrisliu298/awesome-representation-engineering
A resource repository for representation engineering in large language models
54 1 12
chrisliu298/llm-unlearn-eco
[NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts
Language:Python12 4 00
chrisliu298/tapt
Data augmentation by generating new samples
Language:Jupyter Notebook6 2 00
chrisliu298/awesome-sparse-autoencoders
A resource repository of sparse autoencoders for large language models
2 1 00
chrisliu298/minimal-lm-finetune
A minimal example of fine-tuning autoregressive language models with multiple GPUs and DeepSpeed
Language:Python1 1 01
chrisliu298/nanoGCG
A fast + lightweight implementation of the GCG algorithm in PyTorch
Language:Python1 0 0
chrisliu298/min_double_descent
A minimal example of double descent
Language:Python0 1 01
chrisliu298/Skywork-Reward
Rank 1 and 3 reward models on RewardBench
00
chrisliu298/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python0 0
chrisliu298/Awesome-GenAI-Unlearning
0 0
chrisliu298/circuit-breakers
Improving Alignment and Robustness with Circuit Breakers
Language:Jupyter Notebook0 0
chrisliu298/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python0 0
chrisliu298/firefoxCSS
Oneline, minimal, keyboard-centered Firefox CSS theme.
Language:CSS0 0
chrisliu298/halu_clf
Language:Python1 0
chrisliu298/hugo-website
Minimalist Hugo template for academic websites
Language:HTML0 0
chrisliu298/kickstart.nvim
A launch point for your personal nvim configuration
Language:Lua
chrisliu298/muse_bench
Language:Python
chrisliu298/Online-RLHF
A recipe for online RLHF.
Language:Python1
chrisliu298/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Language:Python0 0
chrisliu298/Qwen2.5-Math
A series of math-specific large language models of our Qwen2 series.
Language:Python
chrisliu298/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python0 0
chrisliu298/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python0 0
chrisliu298/rm-score
chrisliu298/SOUL
Official repo for paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"
Language:Python0 0
chrisliu298/SWE-bench
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
chrisliu298/tofu
Landing Page for TOFU
Language:Python
chrisliu298/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
Language:Python
chrisliu298/trl
Train transformer language models with reinforcement learning.
Language:Python
chrisliu298/wmdp
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
Language:Jupyter Notebook0 0