gavinchen430

Pinned Repositories

AutoGrad_CPP
Autograd can automatically differentiate C++ code
Language:C++1 1 00
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++01
documents
0 1 00
llvm-doc
0 1 00
MAI
MAI is a neural network inference engine
Language:C++0 2 01
MemoryArena
MemoryArena used to auto manage memory allocation and memory free.
Language:C++0 1 00
mvm
00
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++1 0 01
Tools
Language:Vim script0 1 00
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.2k 98 2.2k1.1k

gavinchen430's Repositories

gavinchen430/AutoGrad_CPP
Autograd can automatically differentiate C++ code
Language:C++1 1 00
gavinchen430/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++1 0 01
gavinchen430/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++01
gavinchen430/documents
0 1 00
gavinchen430/llvm-doc
0 1 00
gavinchen430/MAI
MAI is a neural network inference engine
Language:C++0 2 01
gavinchen430/MemoryArena
MemoryArena used to auto manage memory allocation and memory free.
Language:C++0 1 00
gavinchen430/mvm
00
gavinchen430/Tools
Language:Vim script0 1 00