anushkahebbar's Stars
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
mit-pdos/xv6-public
xv6 OS
tensorflow/tpu
Reference models and tools for Cloud TPUs.
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
spack/spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
facebookresearch/dlrm
An implementation of a deep learning recommendation model (DLRM)
HobbitLong/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
open-mpi/ompi
Open MPI main development repository
google-research/big_transfer
Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.
google/ml_collections
ML Collections is a library of Python Collections designed for ML use cases.
delimitrou/DeathStarBench
Open-source benchmark suite for cloud microservices
Alibaba-MIIL/ImageNet21K
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
intel/ai-reference-models
Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Intel® Data Center GPUs
facebookresearch/odin
A simple and effective method for detecting out-of-distribution images in neural networks.
Alibaba-MIIL/TResNet
Official Pytorch Implementation of "TResNet: High-Performance GPU-Dedicated Architecture" (WACV 2021)
Alibaba-MIIL/ML_Decoder
Official PyTorch implementation of "ML-Decoder: Scalable and Versatile Classification Head" (2021)
intel/lkp-tests
Linux Kernel Performance tests
google/ghost-userspace
salesforce/hierarchicalContrastiveLearning
DonkeyShot21/cassle
Official repository for the paper "Self-Supervised Models are Continual Learners" (CVPR 2022)
mfleming/performance-resources
A collection of software performance content, blogs, books, and lists.
hpc/xpmem
Linux Cross-Memory Attach
amd/ZenDNN
amd/UIF
tyler-hayes/Deep_SLDA
PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"
rolandobrondolin/DEEP-mon
an eBPF-based monitoring tool to measure container resource usage, power consumption, network I/O, and file I/O
iVishalr/GEMM
Fast Matrix Multiplication Implementation in C programming language. This matrix multiplication algorithm is similar to what Numpy uses to compute dot products.
jodydadescott/firecracker-kernel
alex--m/xpmem
Linux Cross-Memory Attach