Pinned Repositories
AgentPoison
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
CAV-intelligence
This repository contains the algorithms implementation for vehicles scheduling, dispatching and planning in complicated scenarios such as intersection, junction etc. Currently we are developing learnable driving policies module via inverse reinforcement learning algorithms.
HALC
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
MJ-Bench
Official implementation for "MJ-BENCH: Is Your Multimodal Reward Model Really a Good Judge?"
PANDORA
POAR-SRL-4-Robot
The implementation for the integrated SRL-RL algorithm POAR as well as the simulator for environments
pottery-fragments-matching
applying both traditional and heuristics methods to pottery relics restoration (fragments matching, 3D model alignment)
RL_Plane_Strategy
established for the data normalization and reinforcement learning training scheme to train an agent in DCS world
SafeWatch
Official implementation for "SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations"
MJ-Bench
Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"
BillChan226's Repositories
BillChan226/HALC
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
BillChan226/RL_Plane_Strategy
established for the data normalization and reinforcement learning training scheme to train an agent in DCS world
BillChan226/MJ-Bench
Official implementation for "MJ-BENCH: Is Your Multimodal Reward Model Really a Good Judge?"
BillChan226/POAR-SRL-4-Robot
The implementation for the integrated SRL-RL algorithm POAR as well as the simulator for environments
BillChan226/SafeWatch
Official implementation for "SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations"
BillChan226/CAV-intelligence
This repository contains the algorithms implementation for vehicles scheduling, dispatching and planning in complicated scenarios such as intersection, junction etc. Currently we are developing learnable driving policies module via inverse reinforcement learning algorithms.
BillChan226/PANDORA
BillChan226/pottery-fragments-matching
applying both traditional and heuristics methods to pottery relics restoration (fragments matching, 3D model alignment)
BillChan226/robotXhuman
Applying IRL algorithms to learn robot arm motion planning from human demonstration
BillChan226/SafeRLZoo
SafeRLZoo, a standardized toolkit with over 12 SOTA model-free safe RL algorithms based on Spinningup and benchmark safety-critical tasks
BillChan226/BillChan226
Config files for my GitHub profile.
BillChan226/billchan226.github.io
BillChan226/Notebook
BillChan226/SS-RLHF
Train transformer language models with reinforcement learning.
BillChan226/Visualization-Project-for-Greyout
This project mainly develops a visualization platform aiming to process and present the medical data of Greyout/Redout to analyze this symptom and its causes both qualitatively and quantitatively.
BillChan226/Anti-cyberattack-Trajectory-Planning-for-CAVs
BillChan226/CS520-Fall23
This repository is for backups for Professor David Gleich's course CS520 in Fall 2023 at Purdue.
BillChan226/DoggoRobot
Purdue Spring23 CS593 Project
BillChan226/LfD-for-on-policy-AC
a learn-from-demonstration framework for on-policy actor-critic reinforcement learning algorithms
BillChan226/MPC-text-RL4LMs
BillChan226/ODA-Multi-Manipulator
BillChan226/video_guard