Pinned Repositories
QUIK
Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024
cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
core_scheduler
CoreScheduler: A High-Performance Scheduler for Large Model Training
core_scheduler
CoreScheduler: A High-Performance Scheduler for Large Model Training
FDTD_2D
myBLAS
QUIK
Repository for the QUIK project, enabling the use of 4bit kernels for generative inference
torch_cmake_example
xcwang1999's Repositories
xcwang1999/myBLAS
xcwang1999/FDTD_2D
xcwang1999/core_scheduler
CoreScheduler: A High-Performance Scheduler for Large Model Training
xcwang1999/QUIK
Repository for the QUIK project, enabling the use of 4bit kernels for generative inference
xcwang1999/torch_cmake_example