Pinned Repositories
ARM_NEON_2_x86_SSE
asmjit
Machine code generation for C++
AutoKernel
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
AvxToNeon
In this project, the frequently used AVX instructions are encapsulated as independent modules to reduce repeated development workload.
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
awesome-wasm-zh
WebAssembly(wasm)资源精选
beam
✨ Expressive WebGL
bolt
Bolt is a deep learning library with high performance and heterogeneous flexibility.
Book-DISO-WebAssembly
A book related repository which name is 《深入浅出 WebAssembly》
onnx2caffe
convert onnx to caffemodel
seanxcwang's Repositories
seanxcwang/onnx2caffe
convert onnx to caffemodel
seanxcwang/ARM_NEON_2_x86_SSE
seanxcwang/asmjit
Machine code generation for C++
seanxcwang/AutoKernel
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
seanxcwang/AvxToNeon
In this project, the frequently used AVX instructions are encapsulated as independent modules to reduce repeated development workload.
seanxcwang/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
seanxcwang/awesome-wasm-zh
WebAssembly(wasm)资源精选
seanxcwang/beam
✨ Expressive WebGL
seanxcwang/bolt
Bolt is a deep learning library with high performance and heterogeneous flexibility.
seanxcwang/coost
A tiny boost library in C++11.
seanxcwang/cppwasm-book
:books: WebAssembly friendly programming with C/C++ -- Emscripten practice
seanxcwang/DepthwiseConvolution
A personal depthwise convolution layer implementation on caffe by liuhao.(only GPU)
seanxcwang/EasyQuant
EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activations.
seanxcwang/Khronosdotorg
Website resource pages for Khronos.org. Community is encouraged to help keep up-to-date
seanxcwang/MegPeak
seanxcwang/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
seanxcwang/NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
seanxcwang/nnieqat-pytorch
A nnie quantization aware training tool on pytorch.
seanxcwang/paper-reading
深度学习经典、新论文逐段精读
seanxcwang/sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
seanxcwang/taskflow
A General-purpose Parallel and Heterogeneous Task Programming System
seanxcwang/tech-blog
我的个人技术博客(Python、Django、Docker、Go、Redis、ElasticSearch、Kafka、Linux)
seanxcwang/tensorrtx
Implementation of popular deep learning networks with TensorRT network definition API
seanxcwang/TfliteRead
An Open Source Machine Learning Framework for Everyone
seanxcwang/TLCBench
Benchmark scripts for TVM
seanxcwang/TNN
TNN:移动端高性能、轻量级推断框架,同时拥有跨平台、高性能、模型压缩、代码裁剪等众多突出优势。TNN框架在原有Rapidnet、ncnn框架的基础上进一步加强了移动端设备的支持以及性能优化,同时也借鉴了业界主流开源框架高性能和良好拓展性的优点。目前TNN已经在手Q、微视、P图等应用中落地,欢迎大家参与协同共建,促进TNN推断框架进一步完善。
seanxcwang/TRTorch
PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
seanxcwang/version2
Vector class library, latest version
seanxcwang/webgl-examples
Code examples that accompany the MDN WebGL documentation
seanxcwang/xbyak
a JIT assembler for x86(IA-32)/x64(AMD64, x86-64) MMX/SSE/SSE2/SSE3/SSSE3/SSE4/FPU/AVX/AVX2/AVX-512 by C++ header