Pinned Repositories
EAGLE
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
leaderboard-backend
Open sourced backend for Martian's LLM Inference Provider Leaderboard
mlc-ai-package
mlc-dev
mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
octoml_relax
A fork of tlc-pack/relax
One-Shot-Learning-with-Siamese-Networks
Implementation of One Shot Learning using Convolutional Siamese Networks on Omniglot Dataset
relax
temp repo for prototyping, the effort will be upstreamed
SRTuner
SRTuner is a python library that provides efficient auto-tuning building blocks.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
sunggg's Repositories
sunggg/SRTuner
SRTuner is a python library that provides efficient auto-tuning building blocks.
sunggg/EAGLE
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
sunggg/leaderboard-backend
Open sourced backend for Martian's LLM Inference Provider Leaderboard
sunggg/mlc-ai-package
sunggg/mlc-dev
sunggg/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
sunggg/octoml_relax
A fork of tlc-pack/relax
sunggg/One-Shot-Learning-with-Siamese-Networks
Implementation of One Shot Learning using Convolutional Siamese Networks on Omniglot Dataset
sunggg/relax
temp repo for prototyping, the effort will be upstreamed
sunggg/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
sunggg/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
sunggg/web-llm
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.