Pinned Repositories
RWKV-CUDA
The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )
taichi
Productive, portable, and performant GPU programming in Python.
FeatherCNN
FeatherCNN is a high performance inference engine for convolutional neural networks.
3x3_SVD_CUDA
Fast CUDA 3x3 SVD
cpufp
A CPU tool for benchmarking the peak of floating points
light-model-transformer
LSBDS
Large Scale Biology Database Search on Xeon Phi platform
SWhybrid
Taichi-MPI
The Taichi MPI demos with MPI4Py
test_feather_ncnn
The utility project to test computing results for FeatherCNN and ncnn
turbo0628's Repositories
turbo0628/Taichi-MPI
The Taichi MPI demos with MPI4Py
turbo0628/cpufp
A CPU tool for benchmarking the peak of floating points
turbo0628/blog_code
turbo0628/cluster_a3m
turbo0628/diff-gaussian-rasterization
turbo0628/docathon
turbo0628/esm
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
turbo0628/graphi-t
Handy tools & graphics API abstraction for blazing fast prototyping
turbo0628/jax-md
Differentiable, Hardware Accelerated, Molecular Dynamics
turbo0628/JD331
turbo0628/MAC-taichi
A MAC (Marker-And-Cell) solver written in Taichi
turbo0628/MetalBugReprod
Minimal reproduction of an Apple metal compilation bug.
turbo0628/mini-nbody
A simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
turbo0628/mpm_ptx_kernels
compare the mpm kernel performance
turbo0628/PFNN_TVM
Efficient PFNN implementations enabled by TVM
turbo0628/ppl.nn
A primitive library for neural network
turbo0628/prefix_sum_android
turbo0628/quaternion
A brief introduction to the quaternions and its applications in 3D geometry.
turbo0628/rhino3dm
Libraries based on OpenNURBS with a RhinoCommon style
turbo0628/rosetta-json-test
The json test suite for pyrosetta compilation
turbo0628/RWKV-CUDA
The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )
turbo0628/stable-fast
An ultra lightweight inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
turbo0628/taichi
Productive & portable high-performance programming in Python.
turbo0628/taichi-aot-demo
A demo illustrating how to use Taichi as an AOT shader compiler
turbo0628/taichi-benchmark
turbo0628/Taichi-UnityExample
turbo0628/taichi_benchmark
turbo0628/TaichiCloud
turbo0628/uVkCompute
A micro Vulkan compute pipeline and a collection of benchmarking compute shaders
turbo0628/vram_test