qcwthu

PhD Student at Nanyang Technological University, Singapore

qcwthu's Stars

QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python15.3k 112 1.1k1.2k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.8k 101 5831.2k
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python11.1k 70 108697
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python11.1k 166 8172.5k
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8.6k 147 3.8k1.5k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.8k 108 290491
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
Language:Python6.6k 59 340468
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python5.3k 39 42517
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python5.1k 54 335473
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python5k 51 215520
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.9k 111 137422
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
Language:Jupyter Notebook4.5k 26 44159
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python3.7k 30 389353
openai/weak-to-strong
Language:Python2.5k 33 17310
openai/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
Language:Python2.5k 128 39355
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Language:Python2.3k 40 30117
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.9k 25 184345
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook1.6k 17 26180
salesforce/CodeTF
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
Language:Python1.5k 21 34101
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数，训练数据，评估数据，评估方法。
Language:Python1.2k 24 63110
hendrycks/math
The MATH Dataset (NeurIPS 2021)
Language:Python974 12 2090
google-deepmind/funsearch
Language:Jupyter Notebook752 20 6135
allenai/lumos
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
Language:Python457 10 730
GAIR-NLP/MathPile
[NeurlPS D&B 2024] Generative AI for Math: MathPile
Language:Python400 7 521
allenai/fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
Language:JavaScript262 14 2119
sunlab-osu/Understanding-CoT
Language:Jupyter Notebook85 1 17
ntunlp/OpenSource-LLMs-better-than-OpenAI
Listing all reported open-source LLMs achieving a higher score than proprietary, paying OpenAI models (ChatGPT, GPT-4).
69 5 06
MaHuanAAA/g_fair_prompting
Language:Python31 1 02
Guy1m0/ZKML-Benchmark
Language:Jupyter Notebook25 1 01
srhthu/LM-CompEval-Legal
Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"
Language:Python11 1 21

qcwthu

qcwthu's Stars

QwenLM/Qwen

state-spaces/mamba

microsoft/LoRA

NVIDIA/Megatron-LM

triton-inference-server/server

01-ai/Yi

InternLM/InternLM

google/gemma_pytorch

arcee-ai/mergekit

allenai/OLMo

huggingface/alignment-handbook

1rgs/jsonformer

OpenRLHF/OpenRLHF

openai/weak-to-strong

openai/human-eval

thunlp/UltraChat

microsoft/Megatron-DeepSpeed

gkamradt/LLMTest_NeedleInAHaystack

salesforce/CodeTF

SkyworkAI/Skywork

hendrycks/math

google-deepmind/funsearch

allenai/lumos

GAIR-NLP/MathPile

allenai/fm-cheatsheet

sunlab-osu/Understanding-CoT

ntunlp/OpenSource-LLMs-better-than-OpenAI

MaHuanAAA/g_fair_prompting

Guy1m0/ZKML-Benchmark

srhthu/LM-CompEval-Legal