andresrubiop's Stars
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
GMvandeVen/continual-learning
PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.
EdenBelouadah/class-incremental-learning
rasbt/deeplearning-models
A collection of various deep learning architectures, models, and tips
ParCoreLab/ReuseTracker
A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.
boegel/MICA
a Pin tool for collecting microarchitecture-independent workload characteristics
numamma/numamma
NumaMMA is a lightweight memory profiler for parallel applications
intel/pcm
Intel® Performance Counter Monitor (Intel® PCM)
pavlosaim/mapvisual
Software for analysis and visualization of Memory Access Pattern (MAP).
google/multichase
intel/numatop
NumaTOP is an observation tool for runtime memory locality characterization and analysis of processes and threads running on a NUMA system.
GMAP/NPB-CPP
The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures
intel/ipmctl
memtt/numaprof
NUMAPROF is a NUMA memory profliler based on Pintool to track your remote memory accesses.
memtt/malt
MALT is a MALloc Tracker to find where and how your made your memory allocations in C/C++/Fortran applications.
heechul/memguard
Memory Bandwidth Reservation System for Efficient Performance Isolation in Multi-core Processors
intel/intel-cmt-cat
User space software for Intel(R) Resource Director Technology
pmem/ndctl
A "device memory" enabling project encompassing tools and libraries for CXL, NVDIMMs, DAX, memory tiering and other platform memory device topics.
uart/gltracesim
A graphics tracing and replay framework to explore system-level effects on heterogeneous CPU+GPU memory systems.
fraghag/pirate
Implementation of a Cache Pirate. Made for the UART team in Uppsala University.
open-mpi/hwloc
Hardware locality (hwloc)
RRZE-HPC/likwid
Performance monitoring and benchmarking suite
NicolasDenoyelle/dynamic_lstopo
Performance analysis of parallel applications' placement on hierarchical architectures
NicolasDenoyelle/Locality-Aware-Roofline-Model
Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.
NicolasDenoyelle/Hierarchical-monitors
Match (hardware or software)events with topology
LLNL/Caliper
Caliper is an instrumentation and performance profiling library
memkind/memkind
Memkind is an easy-to-use, general-purpose allocator which helps to fully utilize various kinds of memory available in the system, including DRAM, NVDIMM, and HBM
ECP-VeloC/VELOC
Very-Low Overhead Checkpointing System
FPBench/FPBench
A standard for floating point accuracy benchmarks
edf-hpc/verrou
floating-point errors checker