sse2

There are 64 repositories under sse2 topic.

simd-everywhere/simde
Implementations of SIMD instruction sets for systems which don't natively support them.
Language:C2.5k 52 414260
fastfloat/fast_float
Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, Redis and WebKit/Safari
Language:C++1.7k 42 95144
bitshifter/glam-rs
A simple and fast linear algebra library for games and graphics
Language:Rust1.6k 17 236161
ada-url/ada
WHATWG-compliant and fast URL parser written in modern C++, part of Node.js, Clickhouse, Redpanda, Kong, Telegram and Cloudflare Workers.
Language:C++1.4k 23 12993
simdutf/simdutf
Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64. Part of Node.js, WebKit/Safari, Ladybird, Chromium, Cloudflare Workers and Bun.
Language:C++1.3k 22 23580
jfalcou/eve
Expressive Vector Engine - SIMD in C++ Goes Brrrr
Language:C++1.1k 30 53361
powturbo/TurboPFor-Integer-Compression
Fastest Integer Compression
Language:C783 47 95112
shibatch/sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Language:C683 34 193137
p-ranav/fccf
fccf: A command-line tool that quickly searches through C/C++ source code in a directory based on a search string and prints relevant code snippets that match the query.
Language:C++363 7 1119
agenium-scale/nsimd
Agenium Scale vectorization library for CPUs and GPUs
Language:C329 26 6029
powturbo/Turbo-Run-Length-Encoding
TurboRLE-Fastest Run Length Encoding
Language:C285 15 1027
agenium-scale/boost.simd
Boost SIMD
232 32 15448
VectorChief/UniSIMD-assembler
SIMD macro assembler unified for ARM, MIPS, PPC and x86
Language:C88 13 18
swojtasiak/fcml-lib
A general purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).
Language:C87 8 1122
Fig1024/OP_RBF
Optimized Recursive Bilateral Filter
Language:C78 5 718
powturbo/Turbo-Histogram
Fastest Histogram Construction
Language:C68 13 27
jagger2048/fft_simd
A simple demo shows how to use the SIMD,Single Instruction Multiple Data, to optimize and accelerate the FFT algorithm.
Language:C++33 1 16
Auburn/FastSIMD
Low level generic SIMD wrapper for x86, ARM, WASM with dynamic dispatch
Language:C++32 7 25
opferman/SixtyFourBits
x64 Assembly Demo Framework
Language:Assembly26 2 013
unevens/hiir
A header only ready to include mirror of the HIIR library by Laurent De Soras, an oversampling and Hilbert transform library in C++, with additional support for double precision on ARM AArch64 using Neon.
Language:C++26 3 11
VectorChief/QuadRay-engine
Realtime raytracer using SIMD on ARM, MIPS, PPC and x86
Language:C26 6 04
Technologicat/cython-sse-example
Simple example for embedding SSE2 assembly in Cython projects
Language:Python22 4 05
WOnder93/argon2
A multi-arch library implementing the Argon2 password hashing algorithm.
Language:C15 5 19
jonvaudio/simd_granodi
x64/SSE2 and AArch64/NEON SIMD layer in a single C/C++ header file, with functions/classes
Language:C++13 1 00
AlexYaruki/iris
Software implementation of ARM and x86 SIMD intrinsics
Language:C++12 3 12
zamronypj/oprsimd
Operator overloading for vector matrix operation using Intel SIMD SSE/SSE2/SSE3 instructions written in Free Pascal
Language:Pascal10 1 13
zbjornson/bson-to-json
Fast BSON to JSON string transcoder
Language:C++10 4 34
rdbyk/balisc
A fresh (experimental) look at Scilab 6.x
Language:Scilab7 4 3591
6A1AC71C-60A7/Apolloclipse
X86-64 bilateral instruction tokenizer implemented in C. Supports the following processor extensions: AES, AVX, AVX2, AVX512, FMA, MMX, SSE, SSE2, SSE3, SSE4, x87(FPU), VMX. In order to ease testing, a diassembler which transforms tokens into compilable assembly (for NASM compiler) has been implemented.
Language:C6 1 60
turborium/SSE2Sample
Example of using SSE2
Language:Pascal6 2 0
Nemandza82/Symd
C++ header only template library designed to make it easier to write high-performance SIMD (SSE, AVX, Neon) and multi-threaded code.
Language:C++5 2 13
YuriMyakotin/ChaCha20-SIMD
ChaCha20 C SIMD implementations - AVX512, AVX2, SSE2
Language:C5 1 01
borisfoko/Matrix-Multiplication-SIMD-Intrinsics-and-FPU
NxN Matrix Multiplication using SIMD with Intrinsics (MMX, SSE, SSE2, AVX, etc.) and FPU as inline ASM in C
Language:C4 1 01
stevenhoving/yuvconvert
library for optimized rgb to/from yuv convertions.
Language:C++4 3 11
JohT/convolution-benchmarks
Benchmark convolution implementations in C++ with Catch2 visualized with Vega-Lite
Language:C++3 1 11
magic3007/intel-simd
⚡ Leverage Intel vectorization technique MMX, SSE2 and AVX to accelerate the processing of converting YUV420 image into RGB image.
Language:C++3 2 0

sse2

simd-everywhere/simde

fastfloat/fast_float

bitshifter/glam-rs

ada-url/ada

simdutf/simdutf

jfalcou/eve

powturbo/TurboPFor-Integer-Compression

shibatch/sleef

p-ranav/fccf

agenium-scale/nsimd

powturbo/Turbo-Run-Length-Encoding

agenium-scale/boost.simd

VectorChief/UniSIMD-assembler

swojtasiak/fcml-lib

Fig1024/OP_RBF

powturbo/Turbo-Histogram

jagger2048/fft_simd

Auburn/FastSIMD

opferman/SixtyFourBits

unevens/hiir

VectorChief/QuadRay-engine

Technologicat/cython-sse-example

WOnder93/argon2

jonvaudio/simd_granodi

AlexYaruki/iris

zamronypj/oprsimd

zbjornson/bson-to-json

rdbyk/balisc

6A1AC71C-60A7/Apolloclipse

turborium/SSE2Sample

Nemandza82/Symd

YuriMyakotin/ChaCha20-SIMD

borisfoko/Matrix-Multiplication-SIMD-Intrinsics-and-FPU

stevenhoving/yuvconvert

JohT/convolution-benchmarks

magic3007/intel-simd