triton
There are 156 repositories under triton topic.
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
thu-ml/SageAttention
Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
TritonDataCenter/containerpilot
A service for autodiscovery and configuration of applications running in containers
JonathanSalwan/Tigress_protection
Playing with the Tigress software protection. Break some of its protections and solve their reverse engineering challenges. Automatic deobfuscation using symbolic execution, taint analysis and LLVM.
coderonion/awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.
FlagOpen/FlagGems
FlagGems is an operator library for large language models implemented in the Triton Language.
BobMcDear/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
JafarAkhondali/acer-predator-turbo-and-rgb-keyboard-linux-module
Linux kernel module to support Turbo mode and RGB Keyboard for Acer Predator notebook series
rkinas/triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
d4em0n/exrop
Automatic ROPChain Generation
Colton1skees/Dna
LLVM based static binary analysis framework
opendilab/DI-hpc
OpenDILab RL HPC OP Lib, including CUDA and Triton kernel
coderonion/awesome-cuda-triton-hpc
🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR and High Performance Computing (HPC) projects.
SQLab/symgdb
SymGDB - symbolic execution plugin for gdb
kakaobrain/trident
A performance library for machine learning applications.
mmsaeed509/bspwm-dots
Ozoz dotfiles for bspwm, i3WM
NVIDIA-ISAAC-ROS/isaac_ros_object_detection
NVIDIA-accelerated, deep learned model support for image space object detection
clearml/clearml-serving
ClearML - Model-Serving Orchestration and Repository Solution
DeepAuto-AI/hip-attention
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
novioleo/Savior
(WIP)The deployment framework aims to provide a simple, lightweight, fast integrated, pipelined deployment framework for algorithm service that ensures reliability, high concurrency and scalability of services.
alphaSeclab/DBI-Stuff
Resources About Dynamic Binary Instrumentation and Dynamic Binary Analysis
alexzhang13/flashattention2-custom-mask
Triton implementation of FlashAttention2 that adds Custom Masks.
NVIDIA-ISAAC-ROS/isaac_ros_dnn_inference
NVIDIA-accelerated DNN model inference ROS 2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU
notAI-tech/fastDeploy
Deploy DL/ ML inference pipelines with minimal extra code.
hyperai/triton-cn
Triton Documentation in Chinese Simplified / Triton 中文文档
triton/triton
Triton Operating System
WhiteeRabbit/Triton_RAT
🦎Triton_RAT is free and easy to use, one of the best remote administration tools written in Python, fully integrated with Telegram🦎
ergrelet/triton-bn
Binary Ninja plugin that can be used to apply Triton's dead store eliminitation pass on basic blocks or functions.
redis-developer/redis-nvidia-recsys
Three examples of recommendation system pipelines with NVIDIA Merlin and Redis
suvash/nixos-nvidia-cuda-python-docker-compose
A step-by-step guide to setting up Nvidia GPUs with CUDA support running on Docker (and Compose) containers on NixOS host
MarineBioAcousticsRC/Triton
:whale: Scripps Whale Acoustics Lab :earth_americas: Scripps Acoustic Ecology Lab - Triton with remoras in development
kyegomez/EXA-1
An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!
Lallapallooza/fast-audiomentations
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
dame-cell/Triformer
Transformers components but in Triton
mustakimur/COIN-Attacks
COIN Attacks: on Insecurity of Enclave Untrusted Interfaces in SGX - ASPLOS 2020