blas
There are 583 repositories under blas topic.
OpenMathLib/OpenBLAS
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
flame/blis
BLAS-like Library Instantiation Software Framework
eigenteam/eigen-git-mirror
THIS MIRROR IS DEPRECATED -- New url: https://gitlab.com/libeigen/eigen
Reference-LAPACK/lapack
LAPACK development repository
trholding/llama2.c
Llama 2 Everywhere (L2E)
ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
fortran-lang/stdlib
Fortran Standard Library
CNugteren/CLBlast
Tuned OpenCL BLAS
lebedov/scikit-cuda
Python interface to GPU-powered libraries
mateogianolio/vectorious
Linear algebra in TypeScript.
libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
uxlfoundation/oneMath
oneAPI Math Library (oneMath)
100/Cranium
🤖 A portable, header-only, artificial neural network library written in C99
lessthanoptimal/ejml
A fast and easy to use linear algebra library written in Java for dense, sparse, real, and complex matrices.
conradsnicta/armadillo-code
Armadillo: fast C++ library for linear algebra & scientific computing - https://arma.sourceforge.net
optimatika/ojAlgo
oj! Algorithms
Foadsf/Cmathtuts
trying to collect all useful tutorials for famous C math and linear algebra libraries such as CBLAS, CLAPACK, GSL...
ROCm/rocBLAS
[DEPRECATED] Moved to ROCm/rocm-libraries repo
giaf/blasfeo
Basic linear algebra subroutines for embedded optimization
kokkos/kokkos-kernels
Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
coderonion/awesome-cuda-and-hpc
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.
R-js/blasjs
Pure Javascript manually written :ok_hand: implementation of BLAS, Many numerical software applications use BLAS computations, including Armadillo, LAPACK, LINPACK, GNU Octave, Mathematica, MATLAB, NumPy, R, and Julia.
mratsim/laser
The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
ROCm/Tensile
[DEPRECATED] Moved to ROCm/rocm-libraries repo
explosion/cython-blis
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
libmir/mir
Mir (backports): Sparse tensors, Hoffman
ricosjp/monolish
monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
SciFortran/SciFortran
An open-source Fortran library for mathematics, science and engineering (*in a way* just like scipy for python)
xtensor-stack/xtensor-blas
BLAS extension to xtensor
james-bowman/sparse
Sparse matrix formats for linear algebra supporting scientific and machine learning applications
yzhaiustc/Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F
Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.
ROCm/hipBLAS
[DEPRECATED] Moved to ROCm/rocm-libraries repo
cp2k/dbcsr
DBCSR: Distributed Block Compressed Sparse Row matrix library
sevagh/demucs.cpp
C++17 port of Demucs v3 (hybrid) and v4 (hybrid transformer) models with ggml and Eigen3
mmottl/lacaml
OCaml bindings for BLAS/LAPACK (high-performance linear algebra Fortran libraries)
yui0/slibs
Single file libraries for C/C++