swordfate's Stars
kanadeblisst00/high-quality-biz
我关注的一些优质公众号,基本都是js逆向和安卓逆向方面
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
stackblitz/bolt.new
Prompt, run, edit, and deploy full-stack web applications
voideditor/void
onejune2018/Awesome-LLM-Eval
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
nyu-mll/jiant
jiant is an nlp toolkit
FMInference/H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
RubyMetric/chsrc
chsrc 全平台通用换源工具与框架. Change Source everywhere for every software
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
yale-sys/prompt-cache
Modular and structured prompt caching for low-latency LLM inference
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
InternLM/AcmeTrace
iarai/concurrent-dataloader
Profiling and Improving the PyTorch Dataloader for high-latency Storage
UbiquitousLearning/Efficient_Foundation_Model_Survey
Survey Paper List - Efficient LLM and Foundation Models
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
OpenBMB/BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
ISCS-ZJU/NVAlloc
Source code for NVAlloc-ASPLOS'22
FMInference/FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
sashabaranov/go-openai
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
inducer/pymetis
A Python wrapper around Metis, a graph partitioning package
farkhor/PaRMAT
Multi-threaded Large-Scale RMAT Graph Generator.
zhiqi-0/PaGraph
SoCC'20 and TPDS'21: Scaling GNN Training on Large Graphs via Computation-aware Caching and Partitioning.