teal0range

Nanjing UniversityNanjing, Jiangsu Province, China

teal0range's Stars

wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
64940
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
4.8k261
NIL-zhuang/EfficientRAG-official
Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
Language:Python151
nju-lug/NJUThesis
南京大学学位论文模板
Language:TeX45562
ajyl/dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
Language:Jupyter Notebook478
Vance0124/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
Language:Python9310
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Language:Python1.7k144
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook1.5k238
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python68243
AlignGPT-VL/AlignGPT
Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"
Language:Python293
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.7k3k
gouqi666/DPO-deepspeed
Language:Python7
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python7.2k523
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
92651
jxzhangjhu/Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
65344
MetaGLM/FinGLM
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目，利用开源开放来促进「AI+金融」。
Language:HTML1.7k262
jxzhangjhu/Awesome-LLM-RAG
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
92360
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.8k1.2k
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.5k404
FudanDISC/DISC-FinLLM
DISC-FinLLM，中文金融大语言模型（LLM），旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financial consulting services in financial scenarios.
Language:Python58669
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.4k1.6k
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.3k101
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python9k1.7k
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
3.5k459
onimp/oni_multiplayer
Oxygen Not Included multiplayer mod. Work in progress.
Language:C#22018
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
2.9k247
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.3k2.3k
Efficient-ML/Awesome-Model-Quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
1.8k205
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
Language:Python4.3k450
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Language:Python99180

teal0range

teal0range's Stars

wasiahmad/Awesome-LLM-Synthetic-Data

hijkzzz/Awesome-LLM-Strawberry

NIL-zhuang/EfficientRAG-official

nju-lug/NJUThesis

ajyl/dpo_toxic

Vance0124/Token-level-Direct-Preference-Optimization

zou-group/textgrad

tatsu-lab/alpaca_eval

princeton-nlp/SimPO

AlignGPT-VL/AlignGPT

meta-llama/llama3

gouqi666/DPO-deepspeed

FlagOpen/FlagEmbedding

ydyjya/Awesome-LLM-Safety

jxzhangjhu/Awesome-LLM-Uncertainty-Reliability-Robustness

MetaGLM/FinGLM

jxzhangjhu/Awesome-LLM-RAG

huggingface/trl

InternLM/lmdeploy

FudanDISC/DISC-FinLLM

THUDM/ChatGLM3

OpenLMLab/MOSS-RLHF

DLR-RM/stable-baselines3

MLNLP-World/Paper-Writing-Tips

onimp/oni_multiplayer

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

NVIDIA/Megatron-LM

Efficient-ML/Awesome-Model-Quantization

thunlp/OpenPrompt

thunlp/OpenDelta