Pinned Repositories
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Adlik
Adlik: Toolkit for Accelerating Deep Learning Inference
coding-interview-guide
Coding Interview Guide for cxx.
leetcode
One leetcode a day keeps girlfriend away.
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
NaNAGISaSA's Repositories
NaNAGISaSA/leetcode
One leetcode a day keeps girlfriend away.
NaNAGISaSA/Adlik
Adlik: Toolkit for Accelerating Deep Learning Inference
NaNAGISaSA/coding-interview-guide
Coding Interview Guide for cxx.
NaNAGISaSA/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
NaNAGISaSA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
NaNAGISaSA/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators