workingloong's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
oceanbase/oceanbase
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
applenob/Cpp_Primer_Practice
搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
mindspore-ai/mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
apache/incubator-fury
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
codecov/codecov-action
GitHub Action that uploads coverage to Codecov :open_umbrella:
BrightXiaoHan/CMakeTutorial
CMake中文实战教程
TuGraph-family/tugraph-db
TuGraph is a high performance graph database.
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
DeepRec-AI/DeepRec
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
jsksxs360/How-to-use-Transformers
Transformers 库快速入门教程
Tencent/PatrickStar
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
TuGraph-family/tugraph-analytics
TuGraph Analytics is a distributed graph compute engine.
codefuse-ai/MFTCoder
High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.
tensorflow/recommenders-addons
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
tensorflow/custom-op
Guide for building custom op for TensorFlow
intelligent-machine-learning/glake
GLake: optimizing GPU memory management and IO transmission.
codecov/example-python
Python coverage example
TUDB-Labs/multi-lora-fine-tune
Provide Efficient LLM Fine-Tune via Multi-LoRA Optimization
difizen/libro
libro: 轻松定制、灵活集成的notebook产品方案
CalvinXKY/BasicCUDA
A tutorial for CUDA&PyTorch
AliyunContainerService/et-operator
Kubernetes Operator for AI and Bigdata Elastic Training
qiankunli/qiankunli.github.io