XinYao1994
Received Ph.D. from HKU. Scalable learning system, Consistency & consensus, Erasure coding, SOC & COC(NPU+) designing, OS. Relative cooperations are welcome!
HUST & HKUHongKong
Pinned Repositories
9k8s
kubernetes cluster manage scripts used in HKU cluster
alphatensor
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
awesome-smartnic
A curated list of awesome smartnic tutorials, papers and projects.
FluentPS-paper
Based on PS-Lite
llm-paper-daily
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
plato
A new scalable federated learning research framework
sedna
AI tookit over KubeEdge
TinyComputer
A simple architecture of computer, including CPU, VGA, Keyboard, and OS
Yui
Robot based on DL tech, Tensorflow, Keras, Pytorch and Fast.ai
XinYao1994's Repositories
XinYao1994/alphatensor
XinYao1994/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
XinYao1994/awesome-smartnic
A curated list of awesome smartnic tutorials, papers and projects.
XinYao1994/neural-mmo
Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
XinYao1994/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
XinYao1994/ShieldStore
Trusted in-memory key-value store based on ShieldStore which is published in EuroSys 2019
XinYao1994/FluentPS-paper
Based on PS-Lite
XinYao1994/aili
the fastest in-memory index in the East 东半球最快并发索引
XinYao1994/Artifact
XinYao1994/astra-sim
XinYao1994/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
XinYao1994/awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
XinYao1994/awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
XinYao1994/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
XinYao1994/CodeGen
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.
XinYao1994/dace
DaCe - Data Centric Parallel Programming
XinYao1994/dpdk
Data Plane Development Kit
XinYao1994/entangled
enTangle'd is an amalgamation of all things Tangle
XinYao1994/llm
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
XinYao1994/Megatron-LM
Ongoing research training transformer models at scale
XinYao1994/msccl
Microsoft Collective Communication Library
XinYao1994/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
XinYao1994/Ok-Topk
Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k communication volume which is asymptotically optimal) with the decentralized parallel Stochastic Gradient Descent (SGD) optimizer, and its convergence is proved theoretically and empirically.
XinYao1994/OpenFOAM-dev
OpenFOAM Foundation development repository
XinYao1994/pslite
A lightweight parameter server interface
XinYao1994/RVC
Voice data <= 10 mins can also be used to train a good VC model!
XinYao1994/spark
Mirror of Apache Spark
XinYao1994/spdk
Storage Performance Development Kit
XinYao1994/tiny-training
On-Device Training Under 256KB Memory [NeurIPS'22]
XinYao1994/xyaocs
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes