NaNAGISaSA

China

Pinned Repositories

tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python11.9k 379 3.4k3.5k
Adlik
Adlik: Toolkit for Accelerating Deep Learning Inference
Language:C++0 0 00
coding-interview-guide
Coding Interview Guide for cxx.
Language:C++0 1 00
leetcode
One leetcode a day keeps girlfriend away.
Language:C++2 0 00
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++0 0 00
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++0 0 00
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python0 0 00
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.1k 97 2.1k1.1k
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++3.7k 182 1.3k1k
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33.5k 274 5.9k5.1k

NaNAGISaSA/leetcode
One leetcode a day keeps girlfriend away.
Language:C++2 0 00
NaNAGISaSA/Adlik
Adlik: Toolkit for Accelerating Deep Learning Inference
Language:C++0 0 00
NaNAGISaSA/coding-interview-guide
Coding Interview Guide for cxx.
Language:C++0 1 00
NaNAGISaSA/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++0 0 00
NaNAGISaSA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++0 0 00
NaNAGISaSA/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python0 0 00