chujiezheng

PhD candidate at @thu-coai | @QwenLM

Tsinghua UniversityBeijing, China

chujiezheng's Stars

xai-org/grok-1
Grok open release
Language:Python49.9k 602 2168.3k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python40.7k 232 5.9k5k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.5k 348 2.9k4.2k
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language:Python17.6k 94 4.3k2.2k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python9.1k 77 601648
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python8.2k 110 159503
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python5.6k 47 1.7k489
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python5.2k 56 341497
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
5.1k 64 59769
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
4.8k 40 99481
mosaicml/llm-foundry
LLM training code for Databricks foundation models
Language:Python4.1k 49 391544
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language:Python2.5k 42 24242
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook1.6k 8 160255
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python1.2k 19 3883
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Language:Python1.2k 28 197159
Jonathan-LeRoux/IguanaTex
A PowerPoint add-in to insert LaTeX equations into PowerPoint presentations on Windows and Mac
Language:VBA957 15 7766
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
Language:Python794 8 4446
lmarena/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
Language:Python739 8 3992
efeslab/Nanoflow
A throughput-oriented high-performance serving framework for LLMs
Language:Cuda735 8 2829
caolvchong-top/twitter_download
推特图片视频爬虫;一键下载
Language:Python503 8 7758
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
Language:Python497 17 935
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
Language:Python477 15 2847
meta-math/MetaMath
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Language:Python406 7 2839
OpenBMB/Eurus
Language:Python304 11 1114
alibaba/ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
Language:Python297 16 2723
JailbreakBench/jailbreakbench
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
Language:Python295 4 1331
xingyaoww/mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
Language:Python112 4 47
chujiezheng/LLM-Extrapolation
Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"
Language:Python72 5 12
cascip/awesome-auto-alignment
Collection of papers for scalable automated alignment.
64 3 16
argilla-io/distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
Language:Python24 4 00

chujiezheng

chujiezheng's Stars

xai-org/grok-1

hiyouga/LLaMA-Factory

microsoft/DeepSpeed

BerriAI/litellm

facebookresearch/xformers

jzhang38/TinyLlama

InternLM/lmdeploy

arcee-ai/mergekit

deepseek-ai/DeepSeek-Coder-V2

deepseek-ai/DeepSeek-V2

mosaicml/llm-foundry

databricks/dbrx

tatsu-lab/alpaca_eval

RLHFlow/RLHF-Reward-Modeling

huggingface/lighteval

Jonathan-LeRoux/IguanaTex

yule-BUAA/MergeLM

lmarena/arena-hard-auto

efeslab/Nanoflow

caolvchong-top/twitter_download

maitrix-org/Pandora

RLHFlow/Online-RLHF

meta-math/MetaMath

OpenBMB/Eurus

alibaba/ChatLearn

JailbreakBench/jailbreakbench

xingyaoww/mint-bench

chujiezheng/LLM-Extrapolation

cascip/awesome-auto-alignment

argilla-io/distilabel-spin-dibt