YashasSamaga

University of WashingtonSeattle

YashasSamaga's Stars

codecrafters-io/build-your-own-x
Master programming by recreating your favorite technologies from scratch.
Language:Markdown324k 5.6k 71130k
pbatard/rufus
The Reliable USB Formatting Utility
Language:C29.8k 571 2.4k2.6k
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python27.4k 274 8363k
onnx/onnx
Open standard for machine learning interoperability
Language:Python18.2k 434 2.9k3.7k
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Language:Python15.7k 469 1.2k3.5k
vdumoulin/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
Language:TeX14.2k 343 312.3k
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Language:Python8.6k 69 186356
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++6k 110 1.2k1k
Neargye/magic_enum
Static reflection for enums (to string, from string, iteration) for modern C++, work with any enum type without any macro or boilerplate code
Language:C++5.1k 67 224454
llvm-mirror/llvm
Project moved to: https://github.com/llvm/llvm-project
Language:LLVM4.6k 283 02.1k
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python4.6k 82 244373
oneapi-src/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++3.7k 182 1.3k1k
clab/dynet
DyNet: The Dynamic Neural Network Toolkit
Language:C++3.4k 182 935703
hanickadot/compile-time-regular-expressions
Compile Time Regular Expression in C++
Language:C++3.4k 65 240189
llvm-mirror/clang
Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project
Language:C++3k 198 01.7k
tomgoldstein/loss-landscape
Code for visualizing the loss landscape of neural nets
Language:Python2.9k 33 42406
dendibakh/perf-ninja
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Language:C++2.7k 112 40238
google/XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Language:C1.9k 54 230379
aantron/better-enums
C++ compile-time enum to string, iteration, in a single header file
Language:C++1.7k 56 91173
bloomberg/bde
Basic Development Environment - a set of foundational C++ libraries used at Bloomberg.
Language:C++1.7k 149 115319
novak-99/MLPP
A library created to revitalize C++ as a machine learning front end. Per aspera ad astra.
Language:C++1.1k 22 11155
NervanaSystems/maxas
Assembler for NVIDIA Maxwell architecture
Language:Sass960 89 11164
MingSun-Tse/Efficient-Deep-Learning
Collection of recent methods on (deep) neural network compression and acceleration.
933 54 1132
cginternals/cmake-init
Template for reliable, cross-platform C++ project setup using cmake.
Language:CMake913 40 55117
ikalnytskyi/termcolor
Termcolor is a header-only C++ library for printing colored messages to the terminal. Written just for fun with a help of the Force.
Language:C++844 27 26133
Machine-Learning-Tokyo/papers-with-annotations
Research papers with annotations, illustrations and explanations
829 84 275
NVIDIA/jitify
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
Language:C++519 25 4864
NVIDIA/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
Language:C++483 15 6992
milakov/int_fastdiv
Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.
Language:Cuda71 7 19
pdziepak/sopt
Language:C++8 3 00

YashasSamaga

YashasSamaga's Stars

codecrafters-io/build-your-own-x

pbatard/rufus

tinygrad/tinygrad

onnx/onnx

tensorflow/tensor2tensor

vdumoulin/conv_arithmetic

arogozhnikov/einops

NVIDIA/cutlass

Neargye/magic_enum

llvm-mirror/llvm

facebookincubator/AITemplate

oneapi-src/oneDNN

clab/dynet

hanickadot/compile-time-regular-expressions

llvm-mirror/clang

tomgoldstein/loss-landscape

dendibakh/perf-ninja

google/XNNPACK

aantron/better-enums

bloomberg/bde

novak-99/MLPP

NervanaSystems/maxas

MingSun-Tse/Efficient-Deep-Learning

cginternals/cmake-init

ikalnytskyi/termcolor

Machine-Learning-Tokyo/papers-with-annotations

NVIDIA/jitify

NVIDIA/cudnn-frontend

milakov/int_fastdiv

pdziepak/sopt