workingloong's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
badges/shields
Concise, consistent, and legible badges in SVG and raster format
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
applenob/Cpp_Primer_Practice
搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。
mindspore-ai/mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
apache/fury
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
codecov/codecov-action
GitHub Action that uploads coverage to Codecov :open_umbrella:
TuGraph-family/tugraph-db
TuGraph: A High Performance Graph Database.
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
jsksxs360/How-to-use-Transformers
Transformers 库快速入门教程
DeepRec-AI/DeepRec
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Tencent/PatrickStar
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
codefuse-ai/MFTCoder
High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.
TuGraph-family/tugraph-analytics
GeaFlow: A Streaming Graph Compute Engine.
intelligent-machine-learning/glake
GLake: optimizing GPU memory management and IO transmission.
tensorflow/custom-op
Guide for building custom op for TensorFlow
difizen/libro
A Notebook with Flexible Customization and Easy Integration.
codecov/example-python
Python coverage example
TUDB-Labs/mLoRA
An Efficient "Factory" to Build Multiple LoRA Adapters
CalvinXKY/BasicCUDA
A tutorial for CUDA&PyTorch
AliyunContainerService/et-operator
Kubernetes Operator for AI and Bigdata Elastic Training
qiankunli/qiankunli.github.io
greengerong/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning