qcwthu's Stars
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
state-spaces/mamba
Mamba SSM architecture
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
arcee-ai/mergekit
Tools for merging pretrained large language models.
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
openai/weak-to-strong
openai/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
salesforce/CodeTF
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
hendrycks/math
The MATH Dataset (NeurIPS 2021)
google-deepmind/funsearch
allenai/lumos
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
GAIR-NLP/MathPile
[NeurlPS D&B 2024] Generative AI for Math: MathPile
allenai/fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
sunlab-osu/Understanding-CoT
ntunlp/OpenSource-LLMs-better-than-OpenAI
Listing all reported open-source LLMs achieving a higher score than proprietary, paying OpenAI models (ChatGPT, GPT-4).
MaHuanAAA/g_fair_prompting
Guy1m0/ZKML-Benchmark
srhthu/LM-CompEval-Legal
Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"