ShadenSmith
Technical Staff @ Microsoft AI. Passionate about high performance computing and machine learning.
@MicrosoftBellevue, Washington
ShadenSmith's Stars
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
huggingface/candle
Minimalist ML framework for Rust
phil-opp/blog_os
Writing an OS in Rust
githubnext/monaspace
An innovative superfamily of fonts for code
dmlc/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
state-spaces/mamba
Mamba SSM architecture
InkboxSoftware/excelCPU
16-bit CPU for Excel, and related files
dimforge/nalgebra
Linear algebra library for Rust.
plotters-rs/plotters
A rust drawing library for high quality data plotting for both WASM and native, statically and realtimely 🦀 📈🚀
sirupsen/napkin-math
Techniques and numbers for estimating system's performance from first-principles
0atman/noboilerplate
Code for my talks on the No Boilerplate channel
maestro-os/maestro
Unix-like kernel written in Rust
huggingface/safetensors
Simple, safe way to store and distribute tensors
wjakob/nanobind
nanobind: tiny and efficient C++/Python bindings
sarah-ek/faer-rs
Linear algebra foundation for the Rust programming language
sparsemat/sprs
sparse linear algebra library for rust
KaHIP/KaHIP
KaHIP -- Karlsruhe HIGH Quality Partitioning.
cli99/llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference
arduano/simdeez
easy simd
Dao-AILab/causal-conv1d
Causal depthwise conv1d in CUDA, with a PyTorch interface
microsoft/mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
microsoft/infinibatch
Efficient, check-pointed data loading for deep learning with massive data sets.
coreweave/tensorizer
Module, Model, and Tensor Serialization/Deserialization
yvt/amx-rs
Rust wrapper for Apple Matrix Coprocessor (AMX) instructions
SunDoge/dlpark
A Rust Library for High-Performance Tensor Exchange with Python
ankane/disco-rust
Recommendations for Rust using collaborative filtering
oliverhu/rama
llama2 inference engine in Rust
cjbattagl/GraSP
Distributed Streaming Graph Partitioning (C/MPI)
mortele/FastExp
Implementation of a fast(-ish) approximate exponential function for negative powers, based on polynomial approximations of different orders. Inspired by https://www.researchgate.net/publication/272178514_Fast_Exponential_Computation_on_SIMD_Architectures.
KarypisLab/BDMPI
Big Data Message Passing Interface