sunstrikes's Stars
pytorch/torchtitan
A PyTorch native library for large model training
NVIDIA/cccl
CUDA Core Compute Libraries
baidu/babylon
High-Performance C++ Fundamental Library
facebookresearch/generative-recommenders
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
KindXiaoming/pykan
Kolmogorov Arnold Networks
LazyVim/LazyVim
Neovim config for the lazy
karpathy/llm.c
LLM training in simple, raw C/CUDA
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
xuewujiao/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
hongleizhang/RSPapers
RSTutorials: A Curated List of Must-read Papers on Recommender System.
NVIDIA/cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
intel/isa-l
Intelligent Storage Acceleration Library
DistPsArch/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
romkatv/powerlevel10k
A Zsh theme
nakanomikuorg/arch-guide
✨ archlinux 简明指南 | 本指南包含从 archlinux 安装、显卡驱动、日常软件配置、多媒体制作、编程等你可能需要的全部内容 | 提供在线文档 ✨
aa342138039/JD-SHOPPER
京东自动下单 (自动登录,指定时间预约商品,商品补货监控,自动加购物车,自动下单)
xMuu/arch-kde-fontconfig
Arch Linux KDE Font Config
huaisha1224/jd-assistant
京东抢购助手:包含登录,查询商品库存/价格,添加/清空购物车,抢购商品(下单),查询订单、查询本地生活服务订单验证码状态查询等
AnthonyCalandra/modern-cpp-features
A cheatsheet of modern C++ language and library features.
liangjingkanji/DrakeTyporaTheme
十二种主题风格 - Material Google JetBrains Vue Juejin Purple Ayu Dark
danleifeng/Paddle
PArallel Distributed Deep LEarning (『飞桨』核心框架,高性能单机、分布式训练和跨平台部署)
BBuf/tvm_mlir_learn
compiler learning resources collect.
sassman/t-rec-rs
Blazingly fast terminal recorder that generates animated gif images for the web written in rust
bytedance/byteps
A high performance and generic framework for distributed DNN training
LunarVim/LunarVim
🌙 LunarVim is an IDE layer for Neovim. Completely free and community driven.
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
triton-lang/triton
Development repository for the Triton language and compiler
open-mpi/ompi
Open MPI main development repository
BaguaSys/bagua
Bagua Speeds up PyTorch