Pinned Repositories
asplos23-tutorial
DeepView.Explore
🛠 VSCode plugin that provides visual interface for CentML Tools
DeepView.Predict
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
flexible-inference-bench
A modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.
gpu-usage-estimator
Python script to estimate GPU utilization using NVIDIA Nsight Systems
llm-inference-bench
Lightweight and extensible LLM Inference serving benchmark tool written in Rust.
Sylva
Boost fine-tuning performance with sparse embedded adapters and hierarchical approximate second-order information.
TMLS2022
Artifacts presented at the TMLS 2022 Workshop
VectorWorkshop
CentML's Repositories
CentML/DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
CentML/DeepView.Predict
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
CentML/DeepView.Explore
🛠 VSCode plugin that provides visual interface for CentML Tools
CentML/VectorWorkshop
CentML/flexible-inference-bench
A modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.
CentML/gpu-usage-estimator
Python script to estimate GPU utilization using NVIDIA Nsight Systems
CentML/asplos23-tutorial
CentML/llm-inference-bench
Lightweight and extensible LLM Inference serving benchmark tool written in Rust.
CentML/TMLS2022
Artifacts presented at the TMLS 2022 Workshop
CentML/build-pytorch-from-source
Build PyTorch from source
CentML/centml-python-client
CentML/Sylva
Boost fine-tuning performance with sparse embedded adapters and hierarchical approximate second-order information.
CentML/tmls-workshop-2023
Repository containing the necessary files for TMLS 2023 demo
CentML/coreweave-app
CentML/Ax
Adaptive Experimentation Platform
CentML/ConvNeXt
Code release for ConvNeXt model
CentML/cortex
Production infrastructure for machine learning at scale
CentML/cserve-client
CServe client library
CentML/ecr-anywhere
Pull from private ECR repos... anywhere
CentML/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
CentML/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
CentML/hfta
Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion
CentML/platform_api_python_client
CentML/pulumi
Pulumi - Infrastructure as Code in any programming language. Build infrastructure intuitively on any cloud using familiar languages 🚀
CentML/simple-sidecar
A simple configurable kubernetes sidecar injector.
CentML/training-operator
Distributed ML Training and Fine-Tuning on Kubernetes
CentML/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.