Pinned Repositories
Awesome-CV-Adversarial-Attack-List
This repository is a curated list of papers and open source code about competition for CV Adversarial Attack.
DeepLearningExamples
Deep Learning Examples
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
dlrover
DLRover: An Automatic Distributed Deep Learning System
elastic-gpu-exporter
A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.
FasterTransformer
Transformer related optimization, including BERT, GPT
fastertransformer_backend_llama
flash-attention
Fast and memory-efficient exact attention
fuse-device-plugin
kubernetes device plugin for using /dev/fuse without privilege
go-nvml
Lzhang-hub's Repositories
Lzhang-hub/Awesome-CV-Adversarial-Attack-List
This repository is a curated list of papers and open source code about competition for CV Adversarial Attack.
Lzhang-hub/DeepLearningExamples
Deep Learning Examples
Lzhang-hub/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Lzhang-hub/dlrover
DLRover: An Automatic Distributed Deep Learning System
Lzhang-hub/elastic-gpu-exporter
A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.
Lzhang-hub/FasterTransformer
Transformer related optimization, including BERT, GPT
Lzhang-hub/fastertransformer_backend_llama
Lzhang-hub/flash-attention
Fast and memory-efficient exact attention
Lzhang-hub/fuse-device-plugin
kubernetes device plugin for using /dev/fuse without privilege
Lzhang-hub/go-nvml
Lzhang-hub/gpu-manager
Lzhang-hub/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Lzhang-hub/kube-prometheus
Use Prometheus to monitor Kubernetes and applications running on Kubernetes
Lzhang-hub/kubernetes
Production-Grade Container Scheduling and Management
Lzhang-hub/Lzhang-hub.github.io
个人博客,看效果进入
Lzhang-hub/ke-dlrover
ke version for dlrover
Lzhang-hub/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Lzhang-hub/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Lzhang-hub/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Lzhang-hub/Megatron-LM
Ongoing research training transformer models at scale
Lzhang-hub/nccl-tests
NVIDIA NCCL Tests for Distributed Training
Lzhang-hub/pod-gpu-memoty-monitor
Lzhang-hub/pod-gpushare-metrics-exporter
Forked form
Lzhang-hub/seldon-core
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
Lzhang-hub/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Lzhang-hub/sglang
SGLang is a fast serving framework for large language models and vision language models.
Lzhang-hub/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Lzhang-hub/useful-scripts
🐌 useful scripts for making developer's everyday life easier and happier, involved java, shell etc.
Lzhang-hub/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs