alanzhai219's Stars
dandavison/delta
A syntax-highlighting pager for git, diff, grep, and blame output
gabime/spdlog
Fast C++ logging library.
spmallick/learnopencv
Learn OpenCV : C++ and Python Examples
ml-explore/mlx
MLX: An array framework for Apple silicon
AstroNvim/AstroNvim
AstroNvim is an aesthetic and feature-rich neovim config that is extensible and easy to use with a great set of plugins
Alinshans/MyTinySTL
Achieve a tiny STL in C++11
cyrus-and/gdb-dashboard
Modular visual interface for GDB in Python
wolfpld/tracy
Frame profiler
gaogaotiantian/viztracer
A debugging and profiling tool that can trace and visualize python code execution
barry-far/V2ray-Configs
🛰️✨ Free V2ray Configs , Updating Every 10 minutes.
DeepGraphLearning/LiteratureDL4Graph
A comprehensive collection of recent papers on graph deep learning
intel/pcm
Intel® Performance Counter Monitor (Intel® PCM)
Qihoo360/safe-rules
详细的C/C++编程规范指南,由360质量工程部编著,适用于桌面、服务端及嵌入式软件系统。
andikleen/pmu-tools
Intel PMU profiling tools
AdaptiveCpp/AdaptiveCpp
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
tensor-compiler/taco
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
p12tic/libsimdpp
Portable header-only C++ low level SIMD library
getnf/getnf
A better way to install Nerd Fonts
ekondis/mixbench
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
kokkos/kokkos-tutorials
Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem
NVIDIA/cnmem
A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory
zjhellofss/KuiperLLama
校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
codeplaysoftware/visioncpp
A machine vision library written in SYCL and C++ that shows performance-portable implementation of graph algorithms
boostorg/core
Boost Core Utilities
nicolaswilde/cuda-tensorcore-hgemm
akhin/metamalloc
Malloc as a single-header library. Linux & Windows . Can also be used for local allocations. Repo also provides a live per-thread HTTP memory profiler as a separate single-header with no dependencies
chsasank/device-benchmarks
Benchmarks of different devices I have come across
p-anastas/PARALiA-Framework
-
triSYCL/sycl-sc
SYCL SC wrapper on top of SYCL to experiment with possible SYCL SC concepts