Pinned Repositories
ec2-nvidia-driver
README for installing nvidia driver on AWS EC2 Ubuntu AMI
enc_dec_triton_trtllm
Example of serving TRT-LLM optimized encoder-decoder models like T5/BART through Triton python backend
fil_triton_sagemaker
fraud-notebooks
ft_pytorch_t5
Code to benchmark FT PyTorch T5
ft_t5_demo
ft_triton_t5
Code for benchmarking FT Triton T5 model with random weights
triton-mme-gtc23
triton-trt-oss
Example showing Triton hosting of TensorRT HuggingFace T5 and BART models
triton_trtllm_guide
Installation and usage guide for Triton TRT-LLM
kshitizgupta21's Repositories
kshitizgupta21/triton-trt-oss
Example showing Triton hosting of TensorRT HuggingFace T5 and BART models
kshitizgupta21/enc_dec_triton_trtllm
Example of serving TRT-LLM optimized encoder-decoder models like T5/BART through Triton python backend
kshitizgupta21/triton-mme-gtc23
kshitizgupta21/triton_trtllm_guide
Installation and usage guide for Triton TRT-LLM
kshitizgupta21/ec2-nvidia-driver
README for installing nvidia driver on AWS EC2 Ubuntu AMI
kshitizgupta21/fil_triton_sagemaker
kshitizgupta21/fraud-notebooks
kshitizgupta21/ft_pytorch_t5
Code to benchmark FT PyTorch T5
kshitizgupta21/ft_t5_demo
kshitizgupta21/ft_triton_t5
Code for benchmarking FT Triton T5 model with random weights
kshitizgupta21/GENTRL
Generative Tensorial Reinforcement Learning (GENTRL) model
kshitizgupta21/mme-gpu-blog
kshitizgupta21/moses
Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models
kshitizgupta21/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
kshitizgupta21/llama3_triton_trtllm
Llama-3 deployment on Triton TRT-LLM
kshitizgupta21/nim-deploy
A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
kshitizgupta21/raft_benchmark
kshitizgupta21/transaction_graph_poc