blas
There are 419 repositories under blas topic.
OpenMathLib/OpenBLAS
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
flame/blis
BLAS-like Library Instantiation Software Framework
eigenteam/eigen-git-mirror
THIS MIRROR IS DEPRECATED -- New url: https://gitlab.com/libeigen/eigen
Reference-LAPACK/lapack
LAPACK development repository
trholding/llama2.c
Llama 2 Everywhere (L2E)
CNugteren/CLBlast
Tuned OpenCL BLAS
fortran-lang/stdlib
Fortran Standard Library
lebedov/scikit-cuda
Python interface to GPU-powered libraries
mateogianolio/vectorious
Linear algebra in TypeScript.
libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
ashvardanian/SimSIMD
Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, C, and Swift, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐
oneapi-src/oneMKL
oneAPI Math Kernel Library (oneMKL) Interfaces
100/Cranium
🤖 A portable, header-only, artificial neural network library written in C99
lessthanoptimal/ejml
A fast and easy to use linear algebra library written in Java for dense, sparse, real, and complex matrices.
conradsnicta/armadillo-code
Armadillo: fast C++ library for linear algebra & scientific computing - https://arma.sourceforge.net
optimatika/ojAlgo
oj! Algorithms
Foadsf/Cmathtuts
trying to collect all useful tutorials for famous C math and linear algebra libraries such as CBLAS, CLAPACK, GSL...
ROCm/rocBLAS
Next generation BLAS implementation for ROCm platform
R-js/blasjs
Pure Javascript manually written :ok_hand: implementation of BLAS, Many numerical software applications use BLAS computations, including Armadillo, LAPACK, LINPACK, GNU Octave, Mathematica, MATLAB, NumPy, R, and Julia.
kokkos/kokkos-kernels
Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
giaf/blasfeo
Basic linear algebra subroutines for embedded optimization
mratsim/laser
The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
explosion/cython-blis
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
libmir/mir
Mir (backports): Sparse tensors, Hoffman
ROCm/Tensile
Stretching GPU performance for GEMMs and tensor contractions.
ricosjp/monolish
monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
aamaricci/SciFortran
A library of fortran modules and routines for scientific calculations (*in a way* just like scipy for python)
james-bowman/sparse
Sparse matrix formats for linear algebra supporting scientific and machine learning applications
xtensor-stack/xtensor-blas
BLAS extension to xtensor
cp2k/dbcsr
DBCSR: Distributed Block Compressed Sparse Row matrix library
mmottl/lacaml
OCaml bindings for BLAS/LAPACK (high-performance linear algebra Fortran libraries)
MasonProtter/Gaius.jl
Divide and Conquer Linear Algebra
calebzulawski/cotila
A compile-time linear algebra system for C++
yui0/slibs
Single file libraries for C/C++
ROCm/hipBLAS
ROCm BLAS marshalling library
codingonion/awesome-cuda-tensorrt-fpga
🔥🔥🔥 A collection of some awesome public NVIDIA CUDA, cuBLAS, cuDNN, TensorRT, AMD ROCm and FPGA projects.