CharlieFRuan's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
llvm/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
meta-llama/llama3
The official Meta Llama 3 GitHub site
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
modularml/mojo
The Mojo Programming Language
openai/openai-python
The official Python library for the OpenAI API
ml-explore/mlx
MLX: An array framework for Apple silicon
triton-lang/triton
Development repository for the Triton language and compiler
langchain-ai/langchainjs
🦜🔗 Build context-aware reasoning applications 🦜🔗
xenova/transformers.js
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
merrymercy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
jacoblee93/fully-local-pdf-chatbot
Yes, it's another chat over documents implementation... but this one is entirely local!
huggingface/huggingface.js
Utilities to use the Hugging Face Hub API
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
j2kun/mlir-tutorial
MLIR For Beginners tutorial
mirage-project/mirage
A multi-level tensor algebra superoptimizer
pmodels/mpich
Official MPICH Repository
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
Cornell-RelaxML/quip-sharp
HazyResearch/aisys-building-blocks
Building blocks for foundation models.
mlc-ai/web-llm-chat
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.
KnowingNothing/MatmulTutorial
A Easy-to-understand TensorOp Matmul Tutorial
mlc-ai/tokenizers-cpp
Universal cross-platform tokenizers binding to HF and sentencepiece
mlc-ai/mlc-assistant
Chat with your documents and improve your writing using large-language models within your browser.
Jokeren/triton-samples
mlc-ai/package
GarlGuo/CD-GraB
CD-GraB is a distributed gradient balancing framework that aims to find distributed data permutation with provably better convergence guarantees than Distributed Random Reshuffling (D-RR). https://arxiv.org/pdf/2302.00845.pdf.