Pinned Repositories
ai_models
ArchMeasureBench
BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
CLBlast
Tuned OpenCL BLAS
flexible-gemm
flexible-gemm conv of deepcore
hpc_dev_docs
miCore
vim-setup
WorkTips
xingjinglu's Repositories
xingjinglu/ArchMeasureBench
xingjinglu/flexible-gemm
flexible-gemm conv of deepcore
xingjinglu/reference
Reference implementations of MLPerf benchmarks
xingjinglu/bit-rnn
Quantize weights and activations in Recurrent Neural Networks.
xingjinglu/bolt
Bolt is a deep learning framework with high performance and heterogeneous flexibility.
xingjinglu/cross-compile-tools
xingjinglu/CUDA-Winograd
Fast CUDA Kernels for ResNet Inference.
xingjinglu/Decoding-CUDA-Binary
xingjinglu/deepcore_source_code
Subpart source code of of deepcore v0.7
xingjinglu/DeepLearningExamples
Deep Learning Examples
xingjinglu/DeepPerf
DeepPerf is a set of cuda assembling developing tools
xingjinglu/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
xingjinglu/ebook
classic books of computer science!
xingjinglu/ecosystem
Integration of TensorFlow with other open-source frameworks
xingjinglu/FeatherCNN
FeatherCNN is a high performance inference engine for convolutional neural networks.
xingjinglu/Freeflow
High performance container overlay networks on Linux. Enabling RDMA (on both InfiniBand and RoCE) and accelerating TCP to bare metal performance. Freeflow requires zero modification on application code/binary.
xingjinglu/gpuweb
Where the GPU for the Web work happens!
xingjinglu/mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
xingjinglu/maxas
Assembler for NVIDIA Maxwell architecture
xingjinglu/models
Models and examples built with TensorFlow
xingjinglu/nmt
TensorFlow Neural Machine Translation Tutorial
xingjinglu/openFrameworks
openFrameworks is a community-developed cross platform toolkit for creative coding in C++.
xingjinglu/pdf-books
:books: PDF 书籍库
xingjinglu/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
xingjinglu/pytorch-mobilenet
PyTorch MobileNet Implementation of "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications"
xingjinglu/pytorch-mobilenet-v3
MobileNetV3 in pytorch and ImageNet pretrained models
xingjinglu/QNNPACK
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
xingjinglu/runtime
A performant and modular runtime for TensorFlow
xingjinglu/tensor2tensor
A library for generalized sequence to sequence models
xingjinglu/TNN
TNN:由腾讯优图实验室打造,移动端高性能、轻量级推理框架,同时拥有跨平台、高性能、模型压缩、代码裁剪等众多突出优势。TNN框架在原有Rapidnet、ncnn框架的基础上进一步加强了移动端设备的支持以及性能优化,同时也借鉴了业界主流开源框架高性能和良好拓展性的优点。目前TNN已经在手Q、微视、P图等应用中落地,欢迎大家参与协同共建,促进TNN推理框架进一步完善。