tsukanov-as's Stars
practical-tutorials/project-based-learning
Curated list of project-based tutorials
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
ByteByteGoHq/system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
google-research/bert
TensorFlow code and pre-trained models for BERT
microsoft/autogen
A programming framework for agentic AI 🤖
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
reflex-dev/reflex
🕸️ Web apps in pure Python 🐍
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
ml-explore/mlx
MLX: An array framework for Apple silicon
githubnext/monaspace
An innovative superfamily of fonts for code
ml-explore/mlx-examples
Examples in the MLX framework
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
pywebio/PyWebIO
Write interactive web app in script way.
nmslib/hnswlib
Header-only C++/python library for fast approximate nearest neighbors
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
nmslib/nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
krishnaik06/Roadmap-To-Learn-Generative-AI-In-2024
flowtyone/flowty-realtime-lcm-canvas
A realtime sketch to image demo using LCM and the gradio library.
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
maypok86/otter
A high performance cache for Go
intel/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
XiongjieDai/GPU-Benchmarks-on-LLM-Inference
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
okuvshynov/slowllama
Finetune llama2-70b and codellama on MacBook Air without quantization
lmg-anon/mikupad
LLM Frontend in a single html file
lvc/pkgdiff
A tool for visualizing changes in Linux software packages
nhatthm/otelsql
OpenTelemetry SQL database driver wrapper for Go
ejones/llama-journey
Experimental adventure game with AI-generated content
Bithack/go-hnsw