Pinned Repositories
KMenas-GPU
Implement K-Menas Clustering on GPUs
LLM-Quantization-Practice
Model quantization and inference.
TensorRT-Bert
Run Bert with TensorRT.
GGgary666's Repositories
GGgary666/Q-DiT
PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
GGgary666/KMenas-GPU
Implement K-Menas Clustering on GPUs
GGgary666/LLM-Quantization-Practice
Model quantization and inference.
GGgary666/TensorRT-Bert
Run Bert with TensorRT.