Pinned Repositories
GODA
Harry-mic.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
la-mbda
LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization
RL-ViGen
This is the repo for RL-ViGen
TREvaL
Reasonable Reward Evaluation of Large Language Models
alignment-handbook
Robust recipes to align language models with human and AI preferences
Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
Value-Augmented-Sampling
RE-Control
Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
Harry-mic's Repositories
Harry-mic/TREvaL
Reasonable Reward Evaluation of Large Language Models
Harry-mic/GODA
Harry-mic/Harry-mic.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Harry-mic/la-mbda
LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization
Harry-mic/RL-ViGen
This is the repo for RL-ViGen