chujiezheng's Stars
xai-org/grok-1
Grok open release
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
arcee-ai/mergekit
Tools for merging pretrained large language models.
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
mosaicml/llm-foundry
LLM training code for Databricks foundation models
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Jonathan-LeRoux/IguanaTex
A PowerPoint add-in to insert LaTeX equations into PowerPoint presentations on Windows and Mac
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
lmarena/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
efeslab/Nanoflow
A throughput-oriented high-performance serving framework for LLMs
caolvchong-top/twitter_download
推特 图片 视频 爬虫;一键下载
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
meta-math/MetaMath
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
OpenBMB/Eurus
alibaba/ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
JailbreakBench/jailbreakbench
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
xingyaoww/mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
chujiezheng/LLM-Extrapolation
Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"
cascip/awesome-auto-alignment
Collection of papers for scalable automated alignment.
argilla-io/distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts