ChasonShi

nothing but cool

Shandong University

ChasonShi's Stars

LC044/WeChatMsg
提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手
Language:Python35k 176 4173.6k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python31.2k 253 5.4k4.7k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.3k 228 2653.1k
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
Language:Python8k 44 686887
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Language:SystemVerilog7.1k 67 24539
DA-southampton/NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识，包括面试题，各种基础知识，工程能力等等，提升核心竞争力
Language:Python6.9k 105 51.2k
kyutai-labs/moshi
Language:Python6.9k 77 85539
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.9k 52 321447
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
3.3k 41 4298
pytorch/torchtitan
A native PyTorch Library for large model training
Language:Python2.7k 43 185215
PeterH0323/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
Language:Python2.6k 40 28402
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
2.1k 32 1181
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook1.6k 17 26171
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1.4k 14 871
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.4k 20 6856
RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Language:JavaScript1.2k 6 1560
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Language:Python887 8 2246
THUDM/LongBench
[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Language:Python680 7 7556
Neph0s/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
590 17 229
cognitivecomputations/laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
Language:Python232 10 827
morecry/CharacterEval
Language:Python184 2 2910
nrimsky/CAA
Steering Llama 2 with Contrastive Activation Addition
Language:Jupyter Notebook101 1 631
VITA-Group/LiGO
[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David Cox, Zhangyang Wang, Yoon Kim
Language:Python82 26 29
nick7nlp/Counting-Stars
Counting-Stars (★)
Language:Jupyter Notebook77 3 32
Furyton/awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.
Language:Python59 4 01
nancheng58/RecMamba
Uncovering Selective State Space Model's Capabilities in Lifelong Sequential Recommendation
Language:Python25 1 22
OpenDFM/ibsen
[ACL 2024] Official code for "IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation" (TheatreMaker)
Language:Python20 2 40
LRudL/sad
Situational Awareness Dataset
Language:HTML192
CURRENTF/MEFT
Language:Python18 1 21
dourgey/qwen2_moe_mergekit
根据Qwen2（Qwen1.5）模型生成qwen2 MoE模型的工具
Language:Python10 1 12

ChasonShi

ChasonShi's Stars

LC044/WeChatMsg

vllm-project/vllm

meta-llama/llama3

axolotl-ai-cloud/axolotl

adam-maj/tiny-gpu

DA-southampton/NLP_ability

kyutai-labs/moshi

arcee-ai/mergekit

ahmetbersoz/chatgpt-prompts-for-academic-writing

pytorch/torchtitan

PeterH0323/Streamer-Sales

mlabonne/llm-datasets

gkamradt/LLMTest_NeedleInAHaystack

XueFuzhao/OpenMoE

FoundationVision/LlamaGen

RahulSChand/gpu_poor

pjlab-sys4nlp/llama-moe

THUDM/LongBench

Neph0s/awesome-llm-role-playing-with-persona

cognitivecomputations/laserRMT

morecry/CharacterEval

nrimsky/CAA

VITA-Group/LiGO

nick7nlp/Counting-Stars

Furyton/awesome-language-model-analysis

nancheng58/RecMamba

OpenDFM/ibsen

LRudL/sad

CURRENTF/MEFT

dourgey/qwen2_moe_mergekit