hpc
There are 2135 repositories under hpc topic.
JuliaLang/julia
The Julia Programming Language
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
arrayfire/arrayfire
ArrayFire: a general purpose GPU library.
spack/spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
volcano-sh/volcano
A Cloud Native Batch System (Project under CNCF)
ProjectPhysX/FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
nextflow-io/nextflow
A DSL for data-driven computational pipelines
apptainer/singularity
Singularity has been renamed to Apptainer as part of us moving the project to the Linux Foundation. This repo has been persisted as a snapshot right before the changes.
diku-dk/futhark
:boom::computer::boom: A data-parallel functional programming language
flame/blis
BLAS-like Library Instantiation Software Framework
open-mpi/ompi
Open MPI main development repository
chapel-lang/chapel
a Productive Parallel Programming Language
mfem/mfem
Lightweight, general, scalable C++ library for finite element methods
zml/zml
High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild
boostorg/compute
A C++ GPU Computing Library for OpenCL
DTolm/VkFFT
Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library
AdaptiveCpp/AdaptiveCpp
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
indigo-dc/udocker
A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.
su2code/SU2
SU2: An Open-Source Suite for Multiphysics Simulation and Design
NVIDIA/cccl
CUDA Core Compute Libraries
NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
trilinos/Trilinos
Primary repository for the Trilinos Project
openucx/ucx
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
apptainer/apptainer
Apptainer: Application containers for Linux
kubernetes-retired/kube-batch
A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
broadinstitute/cromwell
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
gunrock/gunrock
Programmable CUDA/C++ GPU Graph Analytics
jfalcou/eve
Expressive Vector Engine - SIMD in C++ Goes Brrrr
HenrikBengtsson/future
:rocket: R package: future: Unified Parallel and Distributed Processing in R for Everyone
RaftLib/RaftLib
The RaftLib C++ library, streaming/dataflow concurrency via C++ iostream-like operators
openhpc/ohpc
OpenHPC Integration, Packaging, and Test Repo
cp2k/cp2k
Quantum chemistry and solid state physics software package
Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
arrayfire/arrayfire-rust
Rust wrapper for ArrayFire
AgnostiqHQ/covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
chrisvoncsefalvay/learn-julia-the-hard-way
Learn Julia the hard way!