codtiger's Stars
salykova/matmul.c
High-Performance FP32 Matrix Multiplication on CPU
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
shreyansh26/FlashAttention-PyTorch
Implementation of FlashAttention in PyTorch
gpu-mode/lectures
Material for gpu-mode lectures
imbue-ai/cluster-health
farhad-dalirani/StereoVision-SLAM
StereoVision-SLAM is a real-time visual stereo SLAM (Simultaneous Localization and Mapping)
slembcke/Tina
Tina is a teeny tiny, header only, coroutine and job library.
cabralpinto/modular-diffusion
Python library for designing and training your own Diffusion Models with PyTorch.
rania-hossam/LLAMA_FROM_SCRATCH_PYTORCH
TristanBilot/mlx-benchmark
Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
s-macke/VoxelSpace
Terrain rendering algorithm in less than 20 lines of code
lemire/simple_fastfloat_benchmark
fastfloat/fast_float
Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, Redis and WebKit/Safari
lemire/fast_float
Fast and exact implementation of the C++ from_chars functions for float and double types: 4x faster than strtod
ggerganov/ggml
Tensor library for machine learning
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
nothings/stb
stb single-file public domain libraries for C/C++
meta-llama/llama
Inference code for Llama models
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
isl-org/MiDaS
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
cyrusbehr/tensorrt-cpp-api
TensorRT C++ API Tutorial
simdjson/simdjson
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
dariandzirko/Ferris-Stereopsis
Bevy application that will take images or a stream of images and display them to the user
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
ocornut/imgui
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
ggerganov/llama.cpp
LLM inference in C/C++
bkaradzic/bx
Base library used across multiple projects
dair-ai/ML-Papers-Explained
Explanation to key concepts in ML