Pinned Repositories
FLASHNN
ByteMLPerf
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.
ByteMLPerf
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
ByteMLPerf
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
FLASHNN
triton
Development repository for the Triton language and compiler
shengnxu's Repositories
shengnxu/ByteMLPerf
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.
shengnxu/composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
shengnxu/FLASHNN
shengnxu/triton
Development repository for the Triton language and compiler