Pinned Repositories
accfft
A Massively Parallel FFT Library for CPU/GPU
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
AutoTest
awesome-machine-learning-cn
机器学习资源大全中文版,包括机器学习领域的框架、库以及软件
baidu-allreduce
EZLippi.github.io
这是我的个人网站的源码,欢迎fork。
fastmoe
A fast MoE impl for PyTorch
how-to-optimize-gemm
taco
The Tensor Algebra Compiler (taco) computes tensor expressions on sparse and dense tensors
limin2021's Repositories
limin2021/EZLippi.github.io
这是我的个人网站的源码,欢迎fork。
limin2021/mkl-dnn
Intel(R) Math Kernel Library for Deep Neural Networks (Intel(R) MKL-DNN)
limin2021/awesome-machine-learning-cn
机器学习资源大全中文版,包括机器学习领域的框架、库以及软件
limin2021/Benchmark_SpMV_using_CSR5
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
limin2021/bootstrap
The most popular HTML, CSS, and JavaScript framework for developing responsive, mobile first projects on the web.
limin2021/caffe
This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® Xeon processors (HSW+) and Intel® Xeon Phi processors
limin2021/cpufp
A CPU tool for benchmarking the peak of floating points
limin2021/cudnn-training
A CUDNN minimal deep learning training code sample using LeNet.
limin2021/cudpp
CUDA Data Parallel Primitives Library
limin2021/cwlseu.github.io
When you want to be a brilliant man, you should write down something interesting thing for recall.
limin2021/flann
Fast Library for Approximate Nearest Neighbors
limin2021/googletest
Google Test
limin2021/GPU-ArraySort
source code for GPU-ArraySort (same sized arrays)
limin2021/GPU-ArraySort-2.0
This version of GPU-ArraySort is capable of sorting large number of variable sized arrays, also includes some big fixes.
limin2021/gpu-rodinia
Rodinia benchmark
limin2021/kdtree
A simple C library for working with KD-Trees
limin2021/ks
GSKS (General Stride Kernel Summation)
limin2021/LightGBM
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.
limin2021/mergesort_omp
limin2021/moderngpu
Patterns and behaviors for GPU computing
limin2021/NeuralNetworks
limin2021/nnvm-fusion
Kernel Fusion and Runtime Compilation Based on NNVM
limin2021/opencv
Open Source Computer Vision Library
limin2021/OpenVML
Vector Math Library
limin2021/rabbitmq-tutorials
Tutorials for using RabbitMQ in various ways
limin2021/scikit-learn
scikit-learn: machine learning in Python
limin2021/scripts-ubuntu-debian
Scripts for Ubuntu/Debian
limin2021/simd_sort
limin2021/Tinyhttpd
tinyhttpd 是一个不到 500 行的超轻量型 Http Server,用来学习非常不错,可以帮助我们真正理解服务器程序的本质。
limin2021/word2vec
Word2Vec in C++ 11