sse2
There are 64 repositories under sse2 topic.
simd-everywhere/simde
Implementations of SIMD instruction sets for systems which don't natively support them.
fastfloat/fast_float
Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, Redis and WebKit/Safari
bitshifter/glam-rs
A simple and fast linear algebra library for games and graphics
ada-url/ada
WHATWG-compliant and fast URL parser written in modern C++, part of Node.js, Clickhouse, Redpanda, Kong, Telegram and Cloudflare Workers.
simdutf/simdutf
Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64. Part of Node.js, WebKit/Safari, Ladybird, Chromium, Cloudflare Workers and Bun.
jfalcou/eve
Expressive Vector Engine - SIMD in C++ Goes Brrrr
powturbo/TurboPFor-Integer-Compression
Fastest Integer Compression
shibatch/sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
p-ranav/fccf
fccf: A command-line tool that quickly searches through C/C++ source code in a directory based on a search string and prints relevant code snippets that match the query.
agenium-scale/nsimd
Agenium Scale vectorization library for CPUs and GPUs
powturbo/Turbo-Run-Length-Encoding
TurboRLE-Fastest Run Length Encoding
agenium-scale/boost.simd
Boost SIMD
VectorChief/UniSIMD-assembler
SIMD macro assembler unified for ARM, MIPS, PPC and x86
swojtasiak/fcml-lib
A general purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).
Fig1024/OP_RBF
Optimized Recursive Bilateral Filter
powturbo/Turbo-Histogram
Fastest Histogram Construction
jagger2048/fft_simd
A simple demo shows how to use the SIMD,Single Instruction Multiple Data, to optimize and accelerate the FFT algorithm.
Auburn/FastSIMD
Low level generic SIMD wrapper for x86, ARM, WASM with dynamic dispatch
opferman/SixtyFourBits
x64 Assembly Demo Framework
unevens/hiir
A header only ready to include mirror of the HIIR library by Laurent De Soras, an oversampling and Hilbert transform library in C++, with additional support for double precision on ARM AArch64 using Neon.
VectorChief/QuadRay-engine
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Technologicat/cython-sse-example
Simple example for embedding SSE2 assembly in Cython projects
WOnder93/argon2
A multi-arch library implementing the Argon2 password hashing algorithm.
jonvaudio/simd_granodi
x64/SSE2 and AArch64/NEON SIMD layer in a single C/C++ header file, with functions/classes
AlexYaruki/iris
Software implementation of ARM and x86 SIMD intrinsics
zamronypj/oprsimd
Operator overloading for vector matrix operation using Intel SIMD SSE/SSE2/SSE3 instructions written in Free Pascal
zbjornson/bson-to-json
Fast BSON to JSON string transcoder
rdbyk/balisc
A fresh (experimental) look at Scilab 6.x
6A1AC71C-60A7/Apolloclipse
X86-64 bilateral instruction tokenizer implemented in C. Supports the following processor extensions: AES, AVX, AVX2, AVX512, FMA, MMX, SSE, SSE2, SSE3, SSE4, x87(FPU), VMX. In order to ease testing, a diassembler which transforms tokens into compilable assembly (for NASM compiler) has been implemented.
turborium/SSE2Sample
Example of using SSE2
Nemandza82/Symd
C++ header only template library designed to make it easier to write high-performance SIMD (SSE, AVX, Neon) and multi-threaded code.
YuriMyakotin/ChaCha20-SIMD
ChaCha20 C SIMD implementations - AVX512, AVX2, SSE2
borisfoko/Matrix-Multiplication-SIMD-Intrinsics-and-FPU
NxN Matrix Multiplication using SIMD with Intrinsics (MMX, SSE, SSE2, AVX, etc.) and FPU as inline ASM in C
stevenhoving/yuvconvert
library for optimized rgb to/from yuv convertions.
JohT/convolution-benchmarks
Benchmark convolution implementations in C++ with Catch2 visualized with Vega-Lite
magic3007/intel-simd
⚡ Leverage Intel vectorization technique MMX, SSE2 and AVX to accelerate the processing of converting YUV420 image into RGB image.