ethan-digi

Pinned Repositories

TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.6k 93 1.9k978
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8.3k 144 3.8k1.5k
tensorrtllm_backend
The Triton TensorRT-LLM Backend
Language:Python703 24 491104
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python4.9k 76 197413

ethan-digi doesn’t have any repository yet.