chiendb97

Pinned Repositories

CP
Language:C++00
cuda-practice
Language:Cuda0 1 00
cutlass-practice
Language:Cuda1 1 00
FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++0 0 00
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Language:Cuda00
naive_bayes
Language:Python0 1 00
NeMo
NeMo: a toolkit for conversational AI
Language:Python00
nvidia-modelopt
Language:Python0 1 01
shopee_data_science
Language:Python0 2 00
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++0 0 00

chiendb97's Repositories

chiendb97/cutlass-practice
Language:Cuda1 1 00
chiendb97/CP
Language:C++00
chiendb97/cuda-practice
Language:Cuda0 1 00
chiendb97/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++0 0 00
chiendb97/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Language:Cuda00
chiendb97/naive_bayes
Language:Python0 1 00
chiendb97/NeMo
NeMo: a toolkit for conversational AI
Language:Python00
chiendb97/nvidia-modelopt
Language:Python0 1 01
chiendb97/shopee_data_science
Language:Python0 2 00
chiendb97/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++0 0 00
chiendb97/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python0 0 00
chiendb97/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0 00
chiendb97/translation
Language:Python0 2 00
chiendb97/wenet-openfst-android
Language:C++0 1 00
chiendb97/tensorrt_backend
The Triton backend for TensorRT.
Language:C++0 0
chiendb97/ultralytics
Ultralytics YOLO11 🚀