MonadKai

Beijing, China

MonadKai's Stars

baiguoname/qust
Language:Rust12325
SciPhi-AI/R2R
Containerized, state of the art Retrieval-Augmented Generation (RAG) system with a RESTful API
Language:Python3.7k281
explodinggradients/ragas
Supercharge Your LLM Application Evaluations 🚀
Language:Python7.4k749
janhq/ichigo
Local realtime voice AI
Language:Python2k99
sgl-project/tensorrt-demo
TensorRT LLM Benchmark Configuration
Language:Python114
kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Language:Python74539
usefulsensors/moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
Language:Python2.3k101
facebookresearch/ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
Language:Python3.6k521
backprop-ai/vllm-benchmark
Benchmarking the serving capabilities of vLLM
Language:Python225
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Language:Python4.3k220
DanielJDufour/language-detector
Detect the language of text
Language:Python3512
kamalkraj/stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver
Language:Jupyter Notebook12319
graspologic-org/graspologic-native
graspologic-native is a library of rust components to add additional capability to graspologic a python library for intelligently building networks and network embeddings, and for analyzing connected data.
Language:Rust106
graspologic-org/graspologic
Python package for graph statistics
Language:Python824143
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python7.5k935
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Language:Python9.9k1.2k
ZJU-ACES-ISE/chatunitest-maven-plugin
Language:Java5110
microsoft/Tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation
Language:Python73693
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Language:Python16.4k1.7k
triton-inference-server/onnxruntime_backend
The Triton backend for the ONNX Runtime.
Language:C++13457
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Language:Python14.4k899
shreyansh26/Attention-Mask-Patterns
Using FlexAttention to compute attention with different masking patterns
Language:Python40
Lightning-AI/LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
Language:Python2.5k160
mani-kantap/llm-inference-solutions
A collection of all available inference solutions for the LLMs
743
mobiusml/gemlite
Simple and fast low-bit matmul kernels in CUDA / Triton
Language:Python14711
allenai/wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
Language:Python19320
Cambricon/mlu-ops
Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .
Language:C++103104
fw-ai/benchmark
Benchmark suite for LLMs from Fireworks.ai
Language:Python6012
Deep-Learning-Profiling-Tools/triton-viz
Language:Python15514
triton-lang/kernels
Language:Python5017

MonadKai

MonadKai's Stars

baiguoname/qust

SciPhi-AI/R2R

explodinggradients/ragas

janhq/ichigo

sgl-project/tensorrt-demo

kvcache-ai/ktransformers

usefulsensors/moonshine

facebookresearch/ReAgent

backprop-ai/vllm-benchmark

facebookresearch/lingua

DanielJDufour/language-detector

kamalkraj/stable-diffusion-tritonserver

graspologic-org/graspologic-native

graspologic-org/graspologic

SWivid/F5-TTS

HKUDS/LightRAG

ZJU-ACES-ISE/chatunitest-maven-plugin

microsoft/Tutel

openai/swarm

triton-inference-server/onnxruntime_backend

VikParuchuri/surya

shreyansh26/Attention-Mask-Patterns

Lightning-AI/LitServe

mani-kantap/llm-inference-solutions

mobiusml/gemlite

allenai/wimbd

Cambricon/mlu-ops

fw-ai/benchmark

Deep-Learning-Profiling-Tools/triton-viz

triton-lang/kernels