Pinned Repositories
ai_models
ArchMeasureBench
BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
CLBlast
Tuned OpenCL BLAS
flexible-gemm
flexible-gemm conv of deepcore
hpc_dev_docs
miCore
vim-setup
WorkTips
xingjinglu's Repositories
xingjinglu/CLBlast
Tuned OpenCL BLAS
xingjinglu/blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
xingjinglu/caffe
Caffe for Sparse and Low-rank Deep Neural Networks
xingjinglu/caffe2
Caffe2 is a lightweight, modular, and scalable deep learning framework.
xingjinglu/code-samples
Source code examples from the Parallel Forall Blog
xingjinglu/ComputeLibrary
The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.
xingjinglu/cutlass
CUDA Templates for Linear Algebra Subroutines
xingjinglu/Distributed-TensorFlow-Guide
Distributed TensorFlow basics and examples of training algorithms
xingjinglu/docs
xingjinglu/embedded_ai
xingjinglu/farm
Fast routines for the ARM processors
xingjinglu/gemmlowp
Low-precision matrix multiplication
xingjinglu/Halide
a language for image processing and computational photography
xingjinglu/Helium
Helium: Lifting High-Performance Stencil Kernels from Stripped x86 Binaries to Halide DSL Code
xingjinglu/k8s-rdma-device-plugin
RDMA device plugin for Kubernetes
xingjinglu/leetcode
LeetCode Problems' Solutions
xingjinglu/mobile-deep-learning
This research aims at simply deploying CNN(Convolutional Neural Network) on mobile devices, with low complexity and high speed.
xingjinglu/ModernWebPrograming
Web programing
xingjinglu/mshadow
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
xingjinglu/notedown
Markdown <=> IPython Notebook
xingjinglu/NUMA-STREAM
The classic STREAM benchmark, extended to measure NUMA effects.
xingjinglu/openai-gemm
Open single and half precision gemm implementations
xingjinglu/ParaC-TestCase
xingjinglu/ParaFinder
xingjinglu/ParaFinder-Hash
xingjinglu/SkimCaffe
Caffe for Sparse Convolutional Neural Network
xingjinglu/SparseConvNet
Submanifold sparse convolutional networks
xingjinglu/SparseConvNet-1
Spatially-sparse convolutional networks. Allows processing of sparse 2, 3 and 4 dimensional data.Build CNNs on the square/cubic/hypercubic or triangular/tetrahedral/hyper-tetrahedral lattices.
xingjinglu/Tengine
Tengine is a lite, high performance, modular inference engine for embedded device
xingjinglu/veles.simd
Distributed machine learning platform