aaditya-srivathsan

Pinned Repositories

onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++14.2k2.9k
TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language:C++10.6k2.1k
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8.1k1.5k
mmdeploy
OpenMMLab Model Deployment Framework
Language:Python2.7k625
tensorrtllm_backend
The Triton TensorRT-LLM Backend
Language:Python66696
PyrateLimiter
⚔️Python Rate-Limiter using Leaky-Bucket Algorithm Family
Language:Python34236
onnxruntime_backend
The Triton backend for the ONNX Runtime.
Language:C++12554

aaditya-srivathsan doesn’t have any repository yet.