Walter000

韬光养晦

上海-武汉-常德

Walter000's Stars

THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language:Python1k87
weijunext/indie-hacker-tools
收录独立开发者出海技术栈和工具
6.2k585
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3.8k405
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.3k96
ShiArthur03/ShiArthur03
Language:MATLAB10.4k1.9k
beyondguo/LLM-Tuning
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
Language:HTML96099
bilibili/Index-1.9B
A SOTA lightweight multilingual LLM
Language:Python85047
charent/ChatLM-mini-Chinese
中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。
Language:Python1.1k138
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python6.5k1.7k
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.7k447
ollama/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Language:Go90k7.1k
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.8k993
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.5k388
Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
54235
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook6.9k436
JetRunner/BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
Language:Python31038
CodedotAl/gpt-code-clippy
Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57
Language:Python3.3k220
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7.1k577
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python5.7k512
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.7k619
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python140k26.5k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook46.8k5.5k
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Language:Python2.5k302
X-D-Lab/LangChain-ChatGLM-Webui
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答
Language:Python3.1k475
LlamaFamily/Llama-Chinese
Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用
Language:Python13.7k1.2k
LC1332/Chat-Haruhi-Suzumiya
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
Language:Jupyter Notebook1.8k156
githubvpn007/Clash-for-Mac
Clash for Windows for Mac，Clash for Windows for Mac教程，Clash for Windows for Mac配置说明，Clash for Mac
18630
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.4k1.1k
s0md3v/roop
one-click face swap
Language:Python28k6.8k
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
8.9k1.8k

Walter000

Walter000's Stars

THUDM/LongWriter

weijunext/indie-hacker-tools

open-compass/opencompass

OpenLMLab/MOSS-RLHF

ShiArthur03/ShiArthur03

beyondguo/LLM-Tuning

bilibili/Index-1.9B

charent/ChatLM-mini-Chinese

EleutherAI/lm-evaluation-harness

jzhang38/TinyLlama

ollama/ollama

EleutherAI/gpt-neox

huggingface/alignment-handbook

Tebmer/Awesome-Knowledge-Distillation-of-LLMs

OpenBMB/MiniCPM

JetRunner/BERT-of-Theseus

CodedotAl/gpt-code-clippy

ymcui/Chinese-LLaMA-Alpaca-2

yangjianxin1/Firefly

salesforce/BLIP

AUTOMATIC1111/stable-diffusion-webui

facebookresearch/segment-anything

DLLXW/baby-llama2-chinese

X-D-Lab/LangChain-ChatGLM-Webui

LlamaFamily/Llama-Chinese

LC1332/Chat-Haruhi-Suzumiya

githubvpn007/Clash-for-Mac

QwenLM/Qwen

s0md3v/roop

xszyou/Fay