kevinliang888

Princeton CS PhD

Princeton

kevinliang888's Stars

RUCAIBox/HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
Language:Python45229
simplescaling/s1
s1: Simple test-time scaling
Language:Python6.1k710
jonathan-roberts1/zerobench
Code, Data and Red Teaming for ZeroBench
432
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Language:Python23.3k2.1k
Jiayi-Pan/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Language:Python11.4k1.4k
deepseek-ai/DeepSeek-R1
87.5k11.3k
JailbreakBench/jailbreakbench
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
Language:Python31934
WindyLee0822/Process_Q_Model
official implementation of paper "Process Reward Model with Q-value Rankings"
Language:Python514
sdiehl/prm
Library for training process reward models
Language:Python212
RAGEN-AI/RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Language:Python1.2k87
meg-tong/sycophancy-eval
datasets from the paper "Towards Understanding Sycophancy in Language Models"
Language:Jupyter Notebook737
sylinrl/TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
Language:Jupyter Notebook69880
centerforaisafety/hle
Humanity's Last Exam
Language:Python58127
youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀
Language:Shell55.1k11.9k
doocs/leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer（第 2 版）》、《程序员面试金典（第 6 版）》题解
Language:Java33.4k8.8k
princetonvisualai/icons
Language:Python11
520xyxyzq/awesome-object-SLAM
A curated list of Object SLAM papers and resources
27121
mengdi-li/awesome-RLAIF
A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
1584
zhiyuanhubj/UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
Language:Python905
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
Language:Jinja63558
SafeRoboticsLab/Who_Plays_First
Repository for "Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots" - RSS 2024
Language:Python151
SafeRoboticsLab/Deception_Game
Synthesizing safe robot policies in joint physical-belief spaces with deep RL! - CoRL 2023
Language:Python4
kevinliang888/IVR-QA-baselines
[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers
Language:Python141
jxzhangjhu/Awesome-LLM-RAG
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
1.1k75
kevinliang888/IntroPlan
[NeurIPS 2024] Introspective Planning: Aligning Robots’ Uncertainty with Inherent Task Ambiguity
Language:Jupyter Notebook21
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python45.4k5.5k
patrickrchao/JailbreakingLLMs
Language:Python52579
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
Language:Python3.8k514
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.6k477
CambioML/pykoi-rlhf-finetuned-transformers
pykoi: Active learning in one unified interface
Language:Jupyter Notebook41044

kevinliang888

kevinliang888's Stars

RUCAIBox/HaluEval

simplescaling/s1

jonathan-roberts1/zerobench

huggingface/open-r1

Jiayi-Pan/TinyZero

deepseek-ai/DeepSeek-R1

JailbreakBench/jailbreakbench

WindyLee0822/Process_Q_Model

sdiehl/prm

RAGEN-AI/RAGEN

meg-tong/sycophancy-eval

sylinrl/TruthfulQA

centerforaisafety/hle

youngyangyang04/leetcode-master

doocs/leetcode

princetonvisualai/icons

520xyxyzq/awesome-object-SLAM

mengdi-li/awesome-RLAIF

zhiyuanhubj/UoT

chujiezheng/chat_templates

SafeRoboticsLab/Who_Plays_First

SafeRoboticsLab/Deception_Game

kevinliang888/IVR-QA-baselines

jxzhangjhu/Awesome-LLM-RAG

kevinliang888/IntroPlan

hiyouga/LLaMA-Factory

patrickrchao/JailbreakingLLMs

llm-attacks/llm-attacks

CarperAI/trlx

CambioML/pykoi-rlhf-finetuned-transformers