Pinned Repositories
arm-sve-benchmarks
Performance comparison between small hand-written SVE kernels and compiler-generated ones.
dotfiles
My current system configuration.
in512-systems-programming
Lab corrections for the 3rd year IN512 - Systems Programming course at the University of Versailles - Saint-Quentin-en-Yvelines (UVSQ)
interpol
Interposition library to trace and profile non-blocking MPI calls.
kokkos-comm
Unofficial MPI Wrapper for Kokkos
lattice-boltzmann-method
Optimization of a LBM using hybrid parallelization (MPI and OpenMP) and agressive intrinsics vectorization.
master-thesis
Master's thesis on Rust and GPU programming at CEA/Paris-Saclay University
Rust-CUDA
Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust. This fork adds initial CUDA 12 support.
sve-string-routines-benchmarks
Comparative performance benchmarks for hand-optimized Arm SVE implementations of C standard library string routines.
vec
A fast, generic, contiguous growable array type written in pure C.
dssgabriel's Repositories
dssgabriel/Rust-CUDA
Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust. This fork adds initial CUDA 12 support.
dssgabriel/dotfiles
My current system configuration.
dssgabriel/in512-systems-programming
Lab corrections for the 3rd year IN512 - Systems Programming course at the University of Versailles - Saint-Quentin-en-Yvelines (UVSQ)
dssgabriel/master-thesis
Master's thesis on Rust and GPU programming at CEA/Paris-Saclay University
dssgabriel/TOP-24
Labs for the Parallel Optimization Techniques course at Paris-Saclay University
dssgabriel/alfarroba
A colorscheme for night dwellers, based on https://gitlab.com/snakedye/chocolate
dssgabriel/cache-latency
Simple cache latency benchmark using a random pointer chasing loop
dssgabriel/pipelined-memory
dssgabriel/kokkos-comm
Unofficial MPI Wrapper for Kokkos
dssgabriel/sve-string-routines-benchmarks
Comparative performance benchmarks for hand-optimized Arm SVE implementations of C standard library string routines.
dssgabriel/aarch64-ubench
Microbenchmarks for AArch64 FP&SIMD instructions, based on clamchowder's microbenchmarks.
dssgabriel/aoc-23-cpp
Advent of Code 2023 in C++
dssgabriel/aoc-24
Advent of Code 2024 in Zig
dssgabriel/arm-deinterleaving-loads
Benchmarking Arm SIMD de-interleaving loads against scalar instructions.
dssgabriel/cpc
Text calculator with support for units and conversion
dssgabriel/CUDA-image-processing
Simple image processing filters for both CPU and NVIDIA GPUs
dssgabriel/dssgabriel
My personal repo
dssgabriel/eurocc_cfd
CFD code in Rust, C, and Fortran
dssgabriel/fpga-load-store-bandwidth
Load/Store comparison on Intel FPGA using oneAPI
dssgabriel/helix
A post-modern modal text editor.
dssgabriel/internship_gratifications
Outil de calcul du nombre d'heures de travail et de la gratification résultante
dssgabriel/k6071
Exploration repo for KokkosCore issue #6071.
dssgabriel/llama.cpp
Port of Facebook's LLaMA model in C/C++
dssgabriel/Microbenchmarks
Trying to figure various CPU things out
dssgabriel/molecular-simulation
Introduction to Molecular Simulation course project at Paris-Saclay University's HPCS master
dssgabriel/nanobench
Simple, fast, accurate single-header microbenchmarking functionality for C++11/14/17/20
dssgabriel/nccl
Optimized primitives for collective multi-GPU communication
dssgabriel/optimized-routines
Optimized implementations of various library functions for ARM architecture processors
dssgabriel/sampik
Simple API for MPI + Kokkos interop
dssgabriel/startpage
Custom browser startpage