Pinned Repositories
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
RecInterpreter
Recommender_System
CF item-based recommender system
RecommenderSystem
SASRec.pytorch
PyTorch(1.6+) implementation of https://github.com/kang205/SASRec
SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
AgentSims
AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.
ustc-cs-graduate
中国科学技术大学 计算机 考研复试详解
zdszero.github.io
vim markdown wiki notebook
schrieffer-z's Repositories
schrieffer-z/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
schrieffer-z/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
schrieffer-z/RecommenderSystem
schrieffer-z/SASRec.pytorch
PyTorch(1.6+) implementation of https://github.com/kang205/SASRec
schrieffer-z/RecInterpreter
schrieffer-z/Recommender_System
CF item-based recommender system