Pinned Repositories
auto-dev
π§βAutoDev: The AI-powered coding wizard with multilingual support π, auto code generation ποΈ, and a helpful bug-slaying assistant π! Customizable prompts π¨ and a magic Auto Testing feature π§ͺ included! π
automl-dsge
AutoML SGE
cake
Distributed LLM inference for mobile, desktop and server.
CodeGPT
JetBrains extension providing access to state-of-the-art LLMs, such as GPT-4, Code Llama, and others, all for free
cora
devpilot-intellij
Your new coding buddy, designed exclusively for IntelliJ IDEA.
DistServe
Disaggregated serving system for Large Language Models (LLMs).
docs
leetcode-javascript
llm-ls
LSP server leveraging LLMs for code completion (and more?)
MonadKai's Repositories
MonadKai/auto-dev
π§βAutoDev: The AI-powered coding wizard with multilingual support π, auto code generation ποΈ, and a helpful bug-slaying assistant π! Customizable prompts π¨ and a magic Auto Testing feature π§ͺ included! π
MonadKai/cake
Distributed LLM inference for mobile, desktop and server.
MonadKai/CodeGPT
JetBrains extension providing access to state-of-the-art LLMs, such as GPT-4, Code Llama, and others, all for free
MonadKai/devpilot-intellij
Your new coding buddy, designed exclusively for IntelliJ IDEA.
MonadKai/DistServe
Disaggregated serving system for Large Language Models (LLMs).
MonadKai/docs
MonadKai/leetcode-javascript
MonadKai/llm-ls
LSP server leveraging LLMs for code completion (and more?)
MonadKai/martian
MonadKai/mojo
The Mojo Programming Language
MonadKai/MonadKai
About me
MonadKai/MonadKai.github.io
MonadKai/paillier-benchmark
Benchmark between different paillier encryption implemenetations
MonadKai/projecteuler-rust
Rust solution for project euler
MonadKai/audio-preprocess
Preprocess Audio for training
MonadKai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
MonadKai/HAMi
Heterogeneous AI Computing Virtualization Middleware
MonadKai/mosec
A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine
MonadKai/SWE-agent
SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models
MonadKai/SwiftTransformer
High performance Transformer implementation in C++.
MonadKai/text-generation-inference
Large Language Model Text Generation Inference
MonadKai/triton
Development repository for the Triton language and compiler
MonadKai/triton-viz
MonadKai/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
MonadKai/verl
veRL: Volcano Engine Reinforcement Learning for LLM
MonadKai/vllm-dcu
MonadKai/vllm_musa
A high-throughput and memory-efficient inference and serving engine for LLMs
MonadKai/volcano-vgpu-device-plugin
Device-plugin for volcano vgpu which support hard resource isolation
MonadKai/xLLM
A lightweight llama2 inference framework. It can inference llama2-7b with 5000+ tokens/s on signle 4090.
MonadKai/ZLUDA
CUDA on ??? GPUs