kevinliang888's Stars
RUCAIBox/HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
simplescaling/s1
s1: Simple test-time scaling
jonathan-roberts1/zerobench
Code, Data and Red Teaming for ZeroBench
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Jiayi-Pan/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
deepseek-ai/DeepSeek-R1
JailbreakBench/jailbreakbench
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
WindyLee0822/Process_Q_Model
official implementation of paper "Process Reward Model with Q-value Rankings"
sdiehl/prm
Library for training process reward models
RAGEN-AI/RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
meg-tong/sycophancy-eval
datasets from the paper "Towards Understanding Sycophancy in Language Models"
sylinrl/TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
centerforaisafety/hle
Humanity's Last Exam
youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
doocs/leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
princetonvisualai/icons
520xyxyzq/awesome-object-SLAM
A curated list of Object SLAM papers and resources
mengdi-li/awesome-RLAIF
A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
zhiyuanhubj/UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
SafeRoboticsLab/Who_Plays_First
Repository for "Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots" - RSS 2024
SafeRoboticsLab/Deception_Game
Synthesizing safe robot policies in joint physical-belief spaces with deep RL! - CoRL 2023
kevinliang888/IVR-QA-baselines
[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers
jxzhangjhu/Awesome-LLM-RAG
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
kevinliang888/IntroPlan
[NeurIPS 2024] Introspective Planning: Aligning Robots’ Uncertainty with Inherent Task Ambiguity
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
patrickrchao/JailbreakingLLMs
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
CambioML/pykoi-rlhf-finetuned-transformers
pykoi: Active learning in one unified interface