Pinned Repositories
aggregation
The aggregation repository contains a set of algorithms for grouping vertices of DAGs coming from loop-carried dependencies. For more information see Sympiler website
AlgebraicMultigrid.jl
Algebraic Multigrid in Julia
ALP
Home of ALP/GraphBLAS, featuring shared- and distributed-memory auto-parallelisation of linear algebraically formulated graph algorithms and other programs. Soon with more to come!
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
HDagg-benchmark
HDagg's source code used to evaluate the paper "HDagg: Hybrid Aggregation of Loop-carried Dependence Iterations in Sparse Matrix Computations."
learning-chip's Repositories
learning-chip/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
learning-chip/HDagg-benchmark
HDagg's source code used to evaluate the paper "HDagg: Hybrid Aggregation of Loop-carried Dependence Iterations in Sparse Matrix Computations."
learning-chip/aggregation
The aggregation repository contains a set of algorithms for grouping vertices of DAGs coming from loop-carried dependencies. For more information see Sympiler website
learning-chip/AlgebraicMultigrid.jl
Algebraic Multigrid in Julia
learning-chip/ALP
Home of ALP/GraphBLAS, featuring shared- and distributed-memory auto-parallelisation of linear algebraically formulated graph algorithms and other programs. Soon with more to come!
learning-chip/BootCMatch
Bootstrap AMG based on Compatible weighted Matching
learning-chip/BootCMatchG
learning-chip/dmg
Source code for Deep Multigrid method https://arxiv.org/pdf/1711.03825.pdf
learning-chip/ginkgo
Numerical linear algebra software package
learning-chip/helmnet
Deep-learning iterative solver for the heterogeneous 2D Helmholtz equation
learning-chip/libflame
High-performance object-based library for DLA computations
learning-chip/llama.cpp
LLM inference in C/C++
learning-chip/LookaheadDecoding
learning-chip/modern-cpp-template
A template for modern C++ projects using CMake, Clang-Format, CI, unit testing and more, with support for downstream inclusion.
learning-chip/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
learning-chip/neuralBPX
Direct optimisation of BPX preconditioners
learning-chip/npbench
NPBench - A Benchmarking Suite for High-Performance NumPy
learning-chip/PainlessInferenceAcceleration
learning-chip/pargemslr
The parGeMSLR is an MPI-based sparse linear system solution/preconditioning package implementation with C++.
learning-chip/partially-strided-codelet
learning-chip/probabilistic-programming
Notebooks for probabilistic programming
learning-chip/rl_grid_coarsen
learning-chip/SpLLT
Sparse Cholesky solver implemented with a runtime system
learning-chip/spral
Sparse Parallel Robust Algorithms Library
learning-chip/SymbolicLib
learning-chip/sympiler
Sympiler is a Code Generator for Transforming Sparse Matrix Codes
learning-chip/sympiler-eigen
An Eigen Interface for Sympiler and NASOQ
learning-chip/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
learning-chip/treadle
Chisel/Firrtl execution engine
learning-chip/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators