AmberJCJJ's Stars
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
microsoft/mup
maximal update parametrization (µP)
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
SymbioticLab/FedScale
FedScale is a scalable and extensible open-source federated learning (FL) platform.
Devinterview-io/llms-interview-questions
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
SymbioticLab/Oobleck
A resilient distributed training framework
AmberLJC/FLsystem-paper
Federated Learning Systems Paper List
mosharaf/cse585
Advanced Scalable Systems for X
zakuro-ai/sakura
Sakura is the ML library of the Zakuro framework. It provides asynchronous distributed training for Pytorch.
zdevito/single_controller
9Tempest/awesome-ML-heterogeneous-gpu-papers
This repo collects list of papers targeting support ML training/inference on heterogeneous gpu cluster, which is a less studied field