Pinned Repositories
facedetect
Detects one or more faces in the given image / video using NVIDIA Triton Inference Server
gfpgan
Face Restoration with a Generative Facial Prior
gtc-2023-SE52140
Developer Breakout - Accelerating Enterprise Workflows With Triton Server and DALI
kubeflow-servers
Customized docker images which can run on Kubeflow.
Kubernetes-WireGuard-Server
Helm based deployment for a WireGuard server on Kubernetes
layoutlmv3-triton-server
An NVIDIA Triton Server workflow for OCR and the LayoutLMv3 Transformer Model
multi-node-k8s-ml
End-to-end deployment for multi-node training using GPU nodes on a Kubernetes cluster.
nvfuser
Testing the new, integrated compilers in PyTorch.
T5-TensorRT-LLM
T5 model on TensorRT-LLM & Triton Inference Server
triton-server-demo
A brief hands on demo of how to use NVIDIA Triton Server for multiple models.
tuttlebr's Repositories
tuttlebr/layoutlmv3-triton-server
An NVIDIA Triton Server workflow for OCR and the LayoutLMv3 Transformer Model
tuttlebr/multi-node-k8s-ml
End-to-end deployment for multi-node training using GPU nodes on a Kubernetes cluster.
tuttlebr/Kubernetes-WireGuard-Server
Helm based deployment for a WireGuard server on Kubernetes
tuttlebr/T5-TensorRT-LLM
T5 model on TensorRT-LLM & Triton Inference Server
tuttlebr/facedetect
Detects one or more faces in the given image / video using NVIDIA Triton Inference Server
tuttlebr/gfpgan
Face Restoration with a Generative Facial Prior
tuttlebr/nvfuser
Testing the new, integrated compilers in PyTorch.
tuttlebr/triton-server-demo
A brief hands on demo of how to use NVIDIA Triton Server for multiple models.
tuttlebr/gtc-2023-SE52140
Developer Breakout - Accelerating Enterprise Workflows With Triton Server and DALI
tuttlebr/nvidia-megatron-on-triton
An end-to-end framework for training and deploying LLMs with billions and trillions of parameters. This example uses the publicly available 20 billion GPT variant.
tuttlebr/riva-speech-skills
State-of-the-art models, fully accelerated pipelines, and tools to easily add Speech AI capabilities to real-time applications like virtual assistants, call center agent assist, and video conferencing.
tuttlebr/t5-faster-transformer
My working repo for following the NVIDIA blog post originally written by Denis Timonin, Bo Yang Hsueh, Dhruv Singal and Vinh Nguyen
tuttlebr/TimeSeriesPredictionPlatform
Helm port of NVIDIA's TimeSeriesPredictionPlatform
tuttlebr/apple-py
GPU-accelerated PyTorch training on Mac
tuttlebr/cluster-maintenance
General maintenance for my home servers using Ansible
tuttlebr/daedalus
Basement-Grade Kubernetes Cluster
tuttlebr/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
tuttlebr/DeepStream
tuttlebr/flair-on-nvidia-triton
Scripts to help deploy the Flair ner-english-fast model on Triton Server as a TorchScript model.
tuttlebr/generative-style-transfer
Generative-style-transfer with tensorflow 2
tuttlebr/kubernetes-jupyterlab
A simple configuration for launching a Jupyterlab on Kubernetes.
tuttlebr/llm-continuous-batching-benchmarks
tuttlebr/nemo-supervised-fine-tuning
Supervised Fine Tuning (SFT) is the process of finetuning all of the model's parameters on supervised data of inputs and outputs that teaches the model how to follow user specified instructions.
tuttlebr/nv-pre-commit
Pre-commit hooks.
tuttlebr/quickperf
A simple performance collection tool to determine the performance of a GPU enabled environment.
tuttlebr/sleep_sounds
tuttlebr/steganography
A simple notebook demonstrating an image manipulation technique called steganography!
tuttlebr/terraform-oci-arch-postgresql
Terraform module to deploy PostgreSQL on Oracle Cloud Infrastructure (OCI).
tuttlebr/terraform-oci-arch-redis
terraform-oci-arch-redis
tuttlebr/tidbyt
Tidbyt apps for my own use. :)