Motsepe-Jr's Stars
jonhoo/orst
Sorting algorithms in Rust
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
davidmrau/mixture-of-experts
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
hydrallm/llama-moe-v1
gordicaleksa/Open-NLLB
Effort to open-source NLLB checkpoints.
Motsepe-Jr/FastApi_Model_Serving
Odevio/Odevio-CLI
Odevio, the easy way to setup, build & release flutter apps for iOS on Linux, Windows and MacOS
pypyr/pypyr
pypyr task-runner cli & api for automation pipelines. Automate anything by combining commands, different scripts in different languages & applications into one pipeline process.
koayon/awesome-adaptive-computation
A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
srush/llama2.rs
A fast llama2 decoder in pure Rust.
huggingface/candle
Minimalist ML framework for Rust
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
conceptofmind/PaLM
An open-source implementation of Google's PaLM models
srush/LLM-Training-Puzzles
What would you do with 1000 H100s...
Shaunwei/RealChar
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
karpathy/llama2.c
Inference Llama 2 in one file of pure C
sonos/tract
Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference
eth-sri/lmql
A language for constraint-guided and efficient LLM programming.
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Noeda/rllama
Rust+OpenCL+AVX2 implementation of LLaMA inference code
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
KerfuffleV2/smolrsrwkv
A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit evaluation. It can also directly load PyTorch RWKV models.
modularml/mojo
The Mojo Programming Language
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.