Lzhang-hub

Pinned Repositories

Awesome-CV-Adversarial-Attack-List
This repository is a curated list of papers and open source code about competition for CV Adversarial Attack.
0 1 00
DeepLearningExamples
Deep Learning Examples
Language:Jupyter Notebook0 1 00
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0 00
dlrover
DLRover: An Automatic Distributed Deep Learning System
Language:Python0 0 00
elastic-gpu-exporter
A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.
Language:Go0 1 00
FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++0 1 00
fastertransformer_backend_llama
Language:Python0 1 00
flash-attention
Fast and memory-efficient exact attention
Language:Python00
fuse-device-plugin
kubernetes device plugin for using /dev/fuse without privilege
Language:Go0 1 00
go-nvml
Language:C0 1 00

Lzhang-hub's Repositories

Lzhang-hub/Awesome-CV-Adversarial-Attack-List
This repository is a curated list of papers and open source code about competition for CV Adversarial Attack.
0 1 00
Lzhang-hub/DeepLearningExamples
Deep Learning Examples
Language:Jupyter Notebook0 1 00
Lzhang-hub/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0 00
Lzhang-hub/dlrover
DLRover: An Automatic Distributed Deep Learning System
Language:Python0 0 00
Lzhang-hub/elastic-gpu-exporter
A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.
Language:Go0 1 00
Lzhang-hub/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++0 1 00
Lzhang-hub/fastertransformer_backend_llama
Language:Python0 1 00
Lzhang-hub/flash-attention
Fast and memory-efficient exact attention
Language:Python00
Lzhang-hub/fuse-device-plugin
kubernetes device plugin for using /dev/fuse without privilege
Language:Go0 1 00
Lzhang-hub/go-nvml
Language:C0 1 00
Lzhang-hub/gpu-manager
Language:Go0 1 00
Lzhang-hub/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Language:Python0 1 00
Lzhang-hub/kube-prometheus
Use Prometheus to monitor Kubernetes and applications running on Kubernetes
Language:Jsonnet0 1 00
Lzhang-hub/kubernetes
Production-Grade Container Scheduling and Management
Language:Go0 1 00
Lzhang-hub/Lzhang-hub.github.io
个人博客，看效果进入
Language:CSS0 1 00
Lzhang-hub/ke-dlrover
ke version for dlrover
Language:Python1 0
Lzhang-hub/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Lzhang-hub/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python1 0
Lzhang-hub/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python0 0
Lzhang-hub/Megatron-LM
Ongoing research training transformer models at scale
Language:Python0 0
Lzhang-hub/nccl-tests
NVIDIA NCCL Tests for Distributed Training
Language:Shell0 0
Lzhang-hub/pod-gpu-memoty-monitor
Language:Go2 0
Lzhang-hub/pod-gpushare-metrics-exporter
Forked form
Language:Go1 0
Lzhang-hub/seldon-core
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
Language:HTML1 0
Lzhang-hub/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python1 0
Lzhang-hub/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python0 0
Lzhang-hub/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python0 0
Lzhang-hub/useful-scripts
🐌 useful scripts for making developer's everyday life easier and happier, involved java, shell etc.
Language:Shell1 0
Lzhang-hub/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python1 0