ueqri's Stars
jlevy/the-art-of-command-line
Master the command line, in one page
junegunn/fzf
:cherry_blossom: A command-line fuzzy finder
Textualize/rich
Rich is a Python library for rich text and beautiful formatting in the terminal.
sharkdp/hexyl
A command-line hex viewer
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
ROCm/ROCm
AMD ROCm™ Software - GitHub Home
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
SchedMD/slurm
Slurm: A Highly Scalable Workload Manager
NVIDIA/cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
NVIDIA/gpu-monitoring-tools
Tools for monitoring NVIDIA GPUs on Linux
LLNL/zfp
Compressed numerical arrays that support high-speed random access
NVIDIA/jitify
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
PySlurm/pyslurm
Python Interface to Slurm
rackslab/Slurm-web
Open source web dashboard for Slurm HPC clusters
LLNL/blt
A streamlined CMake build system foundation for developing HPC software
openucx/ucc
Unified Collective Communication Library
alibaba/acqdp
Alibaba Cloud - Quantum Development Platform
LLNL/UnifyFS
UnifyFS: A file system for burst buffers
LLNL/shroud
Shroud: generate Fortran and Python wrappers for C and C++ libraries
LLNL/Aluminum
High-performance, GPU-aware communication library
LLNL/camp
Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda
owensgroup/SlabHash
A warp-oriented dynamic hash table for GPUs
LLNL/dataracebench
Data race benchmark suite for evaluating OpenMP correctness tools aimed to detect data races.
LLNL/variorum
Vendor-neutral library for exposing power and performance features across diverse architectures
data61/cuda-fixnum
Extended-precision modular arithmetic library that targets CUDA.
unzvfu/cuda-fixnum
Extended-precision modular arithmetic library that targets CUDA.
LLNL/FPChecker
A dynamic analysis tool to detect floating-point errors in HPC applications.
LLNL/mpibind
Pragmatic, Productive, and Portable Affinity for HPC
heptagonhust/PreciseTimer
A High Precision Timer Library for C++