Pinned Repositories
Activations
A list of current activation functions in deep learning.
admiralty
A system of Kubernetes controllers that intelligently schedules workloads across clusters.
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
apisix
The Cloud-Native API Gateway
argo-workflows
Workflow Engine for Kubernetes
arktos
Arktos for large-scale cloud platform
kong
:monkey: The Microservice API Gateway
raft
Golang implementation of the Raft consensus protocol
tensorflow
Computation using data flow graphs for scalable machine learning
zstd
Zstandard - Fast real-time compression algorithm
smile-luobin's Repositories
smile-luobin/admiralty
A system of Kubernetes controllers that intelligently schedules workloads across clusters.
smile-luobin/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
smile-luobin/apisix
The Cloud-Native API Gateway
smile-luobin/argo-workflows
Workflow Engine for Kubernetes
smile-luobin/arktos
Arktos for large-scale cloud platform
smile-luobin/caliper
A blockchain benchmark framework to measure performance of multiple blockchain solutions
smile-luobin/kong
:monkey: The Microservice API Gateway
smile-luobin/raft
Golang implementation of the Raft consensus protocol
smile-luobin/tensorflow
Computation using data flow graphs for scalable machine learning
smile-luobin/curve
Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file storage.
smile-luobin/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
smile-luobin/deploy
Deploy Development Builds of Open Cluster Management (OCM) on RedHat Openshift Container Platform
smile-luobin/flash-attention
Fast and memory-efficient exact attention
smile-luobin/flashinfer
FlashInfer: Kernel Library for LLM Serving
smile-luobin/Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
smile-luobin/k1s
The world's simplest Kubernetes dashboard (50 lines of Bash code)
smile-luobin/kata-containers
Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workload isolation and security advantages of VMs. https://katacontainers.io/
smile-luobin/kubernetes
Production-Grade Container Scheduling and Management
smile-luobin/mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
smile-luobin/Megatron-LM
Ongoing research training transformer models at scale
smile-luobin/nccl
Optimized primitives for collective multi-GPU communication
smile-luobin/patterns-of-distributed-systems
《Patterns of Distributed Systems》中文版
smile-luobin/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
smile-luobin/rclone
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Yandex Files
smile-luobin/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
smile-luobin/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
smile-luobin/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
smile-luobin/volcano
A Cloud Native Batch System (Project under CNCF)
smile-luobin/xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
smile-luobin/zadig-portal
The zadig web component