CaoYiwei

NLP & CV

Win.dNanjing, Jiangsu, China

Pinned Repositories

Hello-World
Intro
0 0 00
TextBrewer
A PyTorch-based knowledge distillation toolkit for natural language processing
Language:Python0 0 00
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.7k 119 2.3k1.2k
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python41.7k 349 7.2k6.3k

CaoYiwei's Repositories

CaoYiwei/Hello-World
Intro
0 0 00
CaoYiwei/TextBrewer
A PyTorch-based knowledge distillation toolkit for natural language processing
Language:Python0 0 00