tzt101

Hangzhou

tzt101's Stars

pprp/Awesome-LLM-Prune
Awesome list for LLM pruning.
1728
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
Language:Python3.7k283
EleutherAI/the-pile
Language:Python1.5k128
thoppe/The-Pile-PubMed
Download, parse, and filter data PubMed, data-ready for The-Pile
Language:Python202
EleutherAI/pile-pubmedcentral
A script for collecting the PubMed Central dataset in a language modelling friendly format.
Language:Python232
NVIDIA/RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Language:Python74548
LAION-AI/Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
Language:Python20719
microsoft/Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
Language:Jupyter Notebook2.5k263
microsoft/rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
31011
nikhil-ghosh-berkeley/loraplus
Language:Python20515
tuna/thuthesis
LaTeX Thesis Template for Tsinghua University
Language:TeX4.6k1.1k
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.2k184
tianyi-lab/Mosaic-IT
Mosaic IT: Enhancing Instruction Tuning with Data Mosaics
Language:Python153
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.4k2.2k
meta-llama/PurpleLlama
Set of tools to assess and improve LLM security.
Language:Python2.7k453
pytorch/torchtune
PyTorch native finetuning library
Language:Python4.4k448
kongds/MoRA
MoRA: High-Rank Updating for Parameter-Efﬁcient Fine-Tuning
Language:Python34122
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.7k157
ZJLab-DataHub-Security/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Language:Python2
ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
Language:Python1.7k148
bigcode-project/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python832220
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.1k1.9k
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python8k469
apple/corenet
CoreNet: A library for training deep neural networks
Language:Jupyter Notebook7k541
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4.2k449
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML7.9k761
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.6k4.1k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.4k1.9k
ArtificialZeng/Qwen-Tuning
Qwen-Efficient-Tuning
Language:Python426
shikiw/OPERA
[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Language:Python29326

tzt101

tzt101's Stars

pprp/Awesome-LLM-Prune

microsoft/LMOps

EleutherAI/the-pile

thoppe/The-Pile-PubMed

EleutherAI/pile-pubmedcentral

NVIDIA/RULER

LAION-AI/Open-Instruction-Generalist

microsoft/Phi-3CookBook

microsoft/rho

nikhil-ghosh-berkeley/loraplus

tuna/thuthesis

eric-mitchell/direct-preference-optimization

tianyi-lab/Mosaic-IT

meta-llama/llama-recipes

meta-llama/PurpleLlama

pytorch/torchtune

kongds/MoRA

deepseek-ai/DeepSeek-V2

ZJLab-DataHub-Security/LLaMA-Factory

ymcui/Chinese-LLaMA-Alpaca-3

bigcode-project/bigcode-evaluation-harness

EleutherAI/lm-evaluation-harness

jzhang38/TinyLlama

apple/corenet

open-compass/opencompass

LianjiaTech/BELLE

tatsu-lab/stanford_alpaca

ymcui/Chinese-LLaMA-Alpaca

ArtificialZeng/Qwen-Tuning

shikiw/OPERA