TamSiuhin's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
caserec/Datasets-for-Recommender-Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
xhluca/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
RUCAIBox/RecSysDatasets
This is a repository of public data sources for Recommender Systems (RS).
prometheus-eval/prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
lm-sys/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
kaistAI/FLASK
[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
XiangLi1999/ContrastiveDecoding
contrastive decoding
mutonix/RefGPT
r-three/phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
EleutherAI/stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
luchris429/DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
HannahKirk/prism-alignment
The Prism Alignment Project
xhan77/context-aware-decoding
whr000001/DELL
This is code for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
BunsenFeng/AbstainQA
AbstainQA, ACL 2024
BunsenFeng/botsay
What does the bot say? ACL 2024
liugangcode/InfoAlign
The code for "Learning Molecular Representation in a Cell"
Edward-Sun/PIT
pre-instruction-tuning: https://arxiv.org/abs/2402.12847
MatthewYZhang/NLGift
QingkaiZeng/Chain-of-Layer
Code for Chain-of-Layer
fxsxjtu/RICH
This is the offical repository of RICH.