phu0ngng's Stars
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
llvm/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
rockerBOO/awesome-neovim
Collections of awesome neovim plugins.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
federico-busato/Modern-CPP-Programming
Modern C++ Programming Course (C++03/11/14/17/20/23/26)
github/copilot.vim
Neovim plugin for GitHub Copilot
garrettj403/SciencePlots
Matplotlib styles for scientific plotting
ROCm/ROCm
AMD ROCm™ Software - GitHub Home
ROCm/HIP
HIP: C++ Heterogeneous-Compute Interface for Portability
RRZE-HPC/likwid
Performance monitoring and benchmarking suite
cp2k/cp2k
Quantum chemistry and solid state physics software package
codeplaysoftware/computecpp-sdk
Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation
RRZE-HPC/OSACA
Open Source Architecture Code Analyzer
virtual-biohackathons/covid-19-bh20
COVID-19 Biohackathon April 5-11 2020
charmplusplus/charm
The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.
haltakov/simple-photo-gallery
Beautiful and simple photo galleries that help you tell your story. Free and open-source.
Azrael3000/tmpi
Run a parallel command inside a split tmux window
amd/blis
BLAS-like Library Instantiation Software Framework
KhronosGroup/SYCL-Docs
SYCL Open Source Specification
davidrohr/hpl-gpu
High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)
TUM-I5/SWE
An Education-Oriented Code for Parallel Tsunami Simulation
hemelb-codes/hemelb
A high performance parallel lattice-Boltzmann code for large scale fluid flow in complex geometries
ROCm-Developer-Tools/llvm-project
This repo is a mirror of upstream https://github.com/llvm/llvm-project . Every three hours the main branch is mirrored from upstream. Please do not create pull requests on main, use branch amd-trunk-dev
stackhpc/ansible-role-beegfs
Create beegfs server and client
RRZE-HPC/MachineState
This CLI tool and Python3 module collects the current system state for documentation
amd/scalapack
DEPRECATED. This Scalapck repository is deprecated. The last version in this repository is 3.0. Refer to "aocl-scalapack" repository under the same "amd" organization for AOCL Scalapack 3.1 release onwards. https://github.com/amd/aocl-scalapack
SC-Tech-Program/SCreproducibility
SC Reproducibility Initiative
argonne-lcf/SyclCPLX
Sycl complex library header-only
phu0ngng/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.