dleunji's Stars
gongchenooo/WWW23-ODE
Code for [WWW'23] To Store or Not? Online Data Selection for Federated Learning with Limited Storage
conditionWang/FLNK
Federated Learning with New Knowledge -- explore to incorporate various new knowledge into existing FL systems and evolve these systems to reduce costs, extend their lifespan, and facilitate sustainable development.
NVIDIA/ChatRTX
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
likejazz/llama3.cuda
llama3.cuda is a pure C/CUDA implementation for Llama 3 model.
modestyachts/evaluating_machine_accuracy_on_imagenet
microsoft/SparTA
UDC-GAC/venom
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
amazon-science/FeatGraph
twjiang/graphSAGE-pytorch
A PyTorch implementation of GraphSAGE. This package contains a PyTorch implementation of GraphSAGE.
Bruce-Lee-LY/cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
fan1997/HP-SpMM-SDDMM
Fast SpMM implementation on GPUs for GNN (IPDPS'23)
Shigangli/Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
apuaaChen/vectorSparse
pytorch/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
BUAA-CI-LAB/Literatures-on-GNN-Acceleration
A reading list for deep graph learning acceleration.
hgyhungry/ge-spmm
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
google-research/sputnik
A library of GPU kernels for sparse matrix operations.
sunlex0717/DissectingTensorCores
nullplay/Unified-Convolution-Framework
hwang2006/KISTI-DL-tutorial-using-horovod
efficient-ai-study/efficient-ai-study
google-parfait/tensorflow-federated
An open-source framework for machine learning and other computations on decentralized data.
mit-han-lab/tiny-training
On-Device Training Under 256KB Memory [NeurIPS'22]
SMILELab-FL/FedLab
A flexible Federated Learning Framework based on PyTorch, simplifying your Federated Learning research.
google-research/federated
A collection of Google research projects related to Federated Learning and Federated Analytics.
kakaobrain/trident
A performance library for machine learning applications.
vaseline555/Federated-Learning-in-PyTorch
Handy PyTorch implementation of Federated Learning (for your painless research)
innovation-cat/Awesome-Federated-Machine-Learning
Everything about federated learning, including research papers, books, codes, tutorials, videos and beyond