ynych's Stars
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
triton-lang/triton
Development repository for the Triton language and compiler
qhjqhj00/MemoRAG
Empowering RAG with a memory-based data interface for all-purpose applications!
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
metame-ai/awesome-llm-plaza
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
AmadeusChan/Awesome-LLM-System-Papers
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Tencent/matrix
Matrix is a plugin style, non-invasive APM system developed by WeChat.
chatanywhere/GPT_API_free
Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
xiaomabenten/system_architect
💯2024年 系统架构设计师(软考高级)备考资源库+配套免费刷题软件。
sogou/workflow
C++ Parallel Computing and Asynchronous Networking Framework
microsoft/Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
alibaba/rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
VerticalResearchGroup/miaow
An open source GPU based off of the AMD Southern Islands ISA.
hughperkins/VeriGPU
OpenSource GPU, in Verilog, loosely based on RISC-V ISA
2noise/ChatTTS
A generative speech model for daily dialogue.
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
THU-BPM/MarkLLM
MarkLLM: An Open-Source Toolkit for LLM Watermarking.
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
huggingface/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
clu0/unet.cu
UNet diffusion model in pure CUDA