jybbjybb

Pinned Repositories

cnn6-mnist-noisy-signaling
Language:Python00
DCTNet
Language:Python00
prune_utils
Language:Python10
pytorch-cifar
Language:Python00
Transformer_GaP
Language:Python00
codellama
Inference code for CodeLlama models
Language:Python15.9k 182 1941.8k
hnswlib
Header-only C++/python library for fast approximate nearest neighbors
Language:C++4.3k 64 368633
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.3k 89 1.8k930

jybbjybb's Repositories

jybbjybb/prune_utils
Language:Python10
jybbjybb/cnn6-mnist-noisy-signaling
Language:Python00
jybbjybb/DCTNet
Language:Python00
jybbjybb/pytorch-cifar
Language:Python00
jybbjybb/Transformer_GaP
Language:Python00