teal0range's Stars
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
NIL-zhuang/EfficientRAG-official
Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
nju-lug/NJUThesis
南京大学学位论文模板
ajyl/dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
Vance0124/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
AlignGPT-VL/AlignGPT
Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"
meta-llama/llama3
The official Meta Llama 3 GitHub site
gouqi666/DPO-deepspeed
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
jxzhangjhu/Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
MetaGLM/FinGLM
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。
jxzhangjhu/Awesome-LLM-RAG
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
huggingface/trl
Train transformer language models with reinforcement learning.
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
FudanDISC/DISC-FinLLM
DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financial consulting services in financial scenarios.
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
OpenLMLab/MOSS-RLHF
MOSS-RLHF
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
onimp/oni_multiplayer
Oxygen Not Included multiplayer mod. Work in progress.
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Efficient-ML/Awesome-Model-Quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)