de1star

Pinned Repositories

assign3
Language:JavaScript0 1 00
assignment5
Language:JavaScript0 1 00
assignment6
Language:JavaScript0 1 00
assignment7
Language:JavaScript0 1 00
de1star.github.io
Language:JavaScript0 1 00
former_face_figure
Language:Python0 1 00
msccl
Microsoft Collective Communication Library
Language:C++0 0 00
msccl
Microsoft Collective Communication Library
Language:C++327 12 2830
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9k 97 2.1k1k
TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
Language:Python643 14 11247

de1star's Repositories

de1star/assign3
Language:JavaScript0 1 00
de1star/assignment5
Language:JavaScript0 1 00
de1star/assignment6
Language:JavaScript0 1 00
de1star/assignment7
Language:JavaScript0 1 00
de1star/de1star.github.io
Language:JavaScript0 1 00
de1star/former_face_figure
Language:Python0 1 00
de1star/msccl
Microsoft Collective Communication Library
Language:C++0 0 00