alexeybelkov
MIPT masters Applied Mathematics & Physics and Yandex School of Data Analysis alumni
IRA-LabsMoscow
alexeybelkov's Stars
karpathy/llm.c
LLM training in simple, raw C/CUDA
bloomberg/memray
Memray is a memory profiler for Python
boostorg/boost
Super-project for modularized Boost
NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
greg7mdp/parallel-hashmap
A family of header-only, very fast and memory-friendly hashmap and btree containers.
htqin/awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
agavrel/42_CheatSheet
A comprehensive guide to 50 years of evolution of strict C programming, a tribute to Dennis Ritchie's language
intel/x86-simd-sort
C++ template library for high performance SIMD based sorting algorithms
CoffeeBeforeArch/cuda_programming
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
taskflow/awesome-parallel-computing
A curated list of awesome parallel computing resources
m-schuetz/SimLOD
apolukhin/Boost-Cookbook
Online examples from "Boost C++ Application Development Cookbook":
HazyResearch/flash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
google/aqt
NVIDIA/Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
pytorch/multipy
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters in a single C++ process.
CoffeeBeforeArch/parallel_cpp
mgopshtein/cudacpp
C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.
dshah3/GPU-Puzzles
Solve puzzles. Learn CUDA.
hyhieu/easy_pybind
YashasSamaga/ConvolutionBuildingBlocks
GEMM and Winograd based convolutions using CUTLASS
mishgon/vox2vec
This repository is the official implementation of vox2vec: A Framework for Self-supervised Contrastive Learning of Voxel-level Representations in Medical Images
Talmaj/DNN-bench
A library that lets you easily increase efficiency of your deep learning models with no loss of accuracy.
oseledets/nla2023
Skoltech 2023 NLA course
andreacasalino/Fast-Quick-hull
Fast C++ multi-threaded algorithm for computing convex hulls
arseniybelkov/minasan
Telegram Bot to tag all the chat members
CD3/conan_cmake_cpp_project_tools
Tool for managing C++ project with standard tools.
diff7/EffDNets
Efficient Deep Learning models papers & resources
dvpsun/mipt2024s-5-modern-cv