Pinned Repositories
1DD-AVX_v3
Library of High Precision Sparse Matrix Operations Accelerated by SIMD
360WiFi-Linux
360随身WiFi Linux版
A-Primer-on-Memory-Consistency-and-Cache-Coherence
A Primer on Memory Consistency and Cache Coherence (Second Edition) 翻译计划
A64FX_SpMV_hands-on
SpMV hands-on exercise with SVE intrinsics (ACLE) for teaching
AES-ARM-NEON
Efficient implementation of maksed AES on ARM NEON
amx
Apple AMX Instruction Set
ANT-Quantization
arch-arm64
PDPU
PDPU: An Open-Source Posit Dot-Product Unit for Deep Learning Applications
universal
Large collection of number systems providing custom arithmetic and mixed-precision algorithms for AI, Machine Learning, Computer Vision, Signal Processing, CAE, EDA, control, optimization, estimation, and approximation.
memory-paper's Repositories
memory-paper/universal
Large collection of number systems providing custom arithmetic and mixed-precision algorithms for AI, Machine Learning, Computer Vision, Signal Processing, CAE, EDA, control, optimization, estimation, and approximation.
memory-paper/A-Primer-on-Memory-Consistency-and-Cache-Coherence
A Primer on Memory Consistency and Cache Coherence (Second Edition) 翻译计划
memory-paper/amx
Apple AMX Instruction Set
memory-paper/arm-spmv
Sparse matrix-vector multiplication optimized for ARM architecture
memory-paper/ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
memory-paper/FDRA
DRA+RISC-V Exploration Framework
memory-paper/fpu-wrappers
Wrappers for open source FPU hardware implementations.
memory-paper/hlslpp
Math library using hlsl syntax with SSE/NEON support
memory-paper/hpc
Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
memory-paper/Intel-AVX512-Brief-Introduction
Intel AVX-512简介
memory-paper/m4-sme-exploration
Exploring the scalable matrix extension of the Apple M4 processor
memory-paper/mambo
A low-overhead dynamic binary instrumentation and modification tool for ARM (both AArch32 and AArch64 support) and RISC-V (RV64GC).
memory-paper/my_verilog_projects
数字IC秋招项目、手撕代码
memory-paper/neon-guide
Makes ARM NEON documentation accessible (with examples)
memory-paper/novoverse
GEM5 Arm's N1 cores.
memory-paper/nudtbeamer
nudt 开题/毕业 答辩模版
memory-paper/openmp-simd-examples
Exploring allowable uses of OpenMP SIMD
memory-paper/perf_event_tests
Test suite for the Linux perf_event subsystem
memory-paper/plotgen
Parse data and generate plotting scripts based on plotly.
memory-paper/pulpino
An open-source microcontroller system based on RISC-V
memory-paper/riscv-torture
RISC-V Torture Test
memory-paper/Rosko
Row-skipping outer product CPU kernels for sparse-dense matrix multiplication in Deep Neural Networks
memory-paper/Scaling-GEMM-on-the-ARM-Scalable-Vector-Matrix-Extensions
memory-paper/simde
Implementations of SIMD instruction sets for systems which don't natively support them.
memory-paper/SimEng
The University of Bristol HPC Simulation Engine
memory-paper/sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
memory-paper/sme
memory-paper/sme-osx-arm64
memory-paper/TPDS
memory-paper/xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))