bfloat16
There are 20 repositories under bfloat16 topic.
uxlfoundation/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
VoidStarKat/half-rs
Half-precision floating point types f16 and bf16 for Rust.
JuliaMath/BFloat16s.jl
Julia implementation for the BFloat16 number type
higham/chop
Round matrix elements to lower precision in MATLAB
shibatch/tlfloat
C++ template library for floating point operations
DW0RKiN/Floating-point-Library-for-Z80
Floating-Point Arithmetic Library for Z80
nestordemeure/jochastic
A JAX implementation of stochastic addition.
aahouzi/llama2-chatbot-cpu
A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.
afterdusk/flop
IEEE 754-style floating-point converter
KernelTuner/kernel_float
CUDA/HIP header-only library for low-precision (16 bit, 8 bit) and vectorized GPU kernel development
d4l3k/go-bfloat16
Bfloat16 conversion utilities for Go/Golang
nestordemeure/stochastorch
A Pytorch implementation of stochastic addition.
imciner2/ChopBLAS
Basic linear algebra routines implemented using the chop rounding function
puzzlef/vector-sum
Comparison of vector element sum using various data types.
sigurd4/custom_float
Customizable floating point types, with all standard floating point operations implemented from scratch.
puzzlef/pagerank-datatype
Comparison of PageRank algorithm using various datatypes.
StarOne01/bfloat16
A lightweight C++ implementation of the Brain Floating Point (bfloat16) format.
stevechanieee/-1-BFloat16
Hybridized On-Premise and Cloud (HOPC) Deployment Experimentation with Bfloat16