TamSiuhin

Ph.D. student at University of Notre Dame

XJTUNotre Dame

TamSiuhin's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.2k 226 2633.1k
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
Language:Python18.9k 140 8111.4k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.7k 111 136411
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.2k 19 82180
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
1.9k 49 10101
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook1.5k 8 147245
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python1k 12 3192
caserec/Datasets-for-Recommender-Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Language:Jupyter Notebook989 27 4167
xhluca/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Language:Python899 4 2737
RUCAIBox/RecSysDatasets
This is a repository of public data sources for Recommender Systems (RS).
Language:Python858 14 38132
prometheus-eval/prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
Language:Python797 3 3249
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python713 8 7150
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
Language:Python433 18 2348
lm-sys/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
Language:Jupyter Notebook426 5 2757
kaistAI/FLASK
[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
Language:Python211 2 418
XiangLi1999/ContrastiveDecoding
contrastive decoding
Language:Python181 3 1012
mutonix/RefGPT
Language:Python93 2 36
r-three/phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
Language:Python78 1 54
EleutherAI/stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
Language:Python76 3 114
luchris429/DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
Language:Python51 4 129
HannahKirk/prism-alignment
The Prism Alignment Project
Language:Jupyter Notebook37 2 11
xhan77/context-aware-decoding
Language:Python27 1 24
whr000001/DELL
This is code for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Language:Python22 1 20
BunsenFeng/AbstainQA
AbstainQA, ACL 2024
Language:Python19 2 10
BunsenFeng/botsay
What does the bot say? ACL 2024
Language:Python14 1 02
liugangcode/InfoAlign
The code for "Learning Molecular Representation in a Cell"
Language:Python12 2 01
Edward-Sun/PIT
pre-instruction-tuning: https://arxiv.org/abs/2402.12847
7 1 1
MatthewYZhang/NLGift
Language:Python7 1 00
QingkaiZeng/Chain-of-Layer
Code for Chain-of-Layer
Language:Python5 1 12
fxsxjtu/RICH
This is the offical repository of RICH.
Language:Python2 1 00

TamSiuhin

TamSiuhin's Stars

meta-llama/llama3

stanfordnlp/dspy

huggingface/alignment-handbook

eric-mitchell/direct-preference-optimization

zjunlp/LLMAgentPapers

tatsu-lab/alpaca_eval

uclaml/SPIN

caserec/Datasets-for-Recommender-Systems

xhluca/bm25s

RUCAIBox/RecSysDatasets

prometheus-eval/prometheus-eval

princeton-nlp/SimPO

RLHFlow/Online-RLHF

lm-sys/arena-hard-auto

kaistAI/FLASK

XiangLi1999/ContrastiveDecoding

mutonix/RefGPT

r-three/phatgoose

EleutherAI/stackexchange-dataset

luchris429/DiscoPOP

HannahKirk/prism-alignment

xhan77/context-aware-decoding

whr000001/DELL

BunsenFeng/AbstainQA

BunsenFeng/botsay

liugangcode/InfoAlign

Edward-Sun/PIT

MatthewYZhang/NLGift

QingkaiZeng/Chain-of-Layer

fxsxjtu/RICH