Pinned Repositories
af-ayala.github.io
Cabana
Performance-portable library for particle-based simulations
CabanaMD
Molecular dynamics proxy application based on Cabana
CUDALibrarySamples
CUDA Library Samples
DeepBench
Benchmarking Deep Learning operations on different hardware
heffte
Highly Efficient FFT for Exascale
lammps
Public development project of the LAMMPS MD software package
Linear-time-CUR
Linear-time CUR approximation of BEM matrices
Medium
Numerical
af-ayala's Repositories
af-ayala/heffte
Highly Efficient FFT for Exascale
af-ayala/Numerical
af-ayala/Cabana
Performance-portable library for particle-based simulations
af-ayala/lammps
Public development project of the LAMMPS MD software package
af-ayala/Linear-time-CUR
Linear-time CUR approximation of BEM matrices
af-ayala/CabanaMD
Molecular dynamics proxy application based on Cabana
af-ayala/Medium
af-ayala/MPI-benchmark
af-ayala/af-ayala.github.io
af-ayala/CUDALibrarySamples
CUDA Library Samples
af-ayala/DeepBench
Benchmarking Deep Learning operations on different hardware
af-ayala/fiber
af-ayala/findpeaks
The detection of peaks and valleys in a 1d-vector or 2d-array (image)
af-ayala/GPTune
af-ayala/hipFFT
hipFFT is a FFT marshalling library.
af-ayala/miperf
af-ayala/mkldnn-perf
Testing the performance of the MKL-DNN
af-ayala/MPI_monitor
A little library giving you a live monitoring of MPI programs.
af-ayala/omniperf
Advanced Profiling and Analytics for AMD Hardware
af-ayala/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
af-ayala/p3dfft.3
P3DFFT++ (a.k.a. P3DFFT v. 3) is a new generation of P3DFFT library that aims to provide a comprehensive framework for simulating multiscale phenomena. It takes the essence of P3DFFT further by creating an extensible, modular structure uniquely adaptable to a greater range of use cases. The users can specify in detail what kind of data layout they would like to use, both in terms of local memory ordering and the processor layout. Just like P3DFFT, P3DFFT++ is a distributed software package, using MPI as the primary method for interprocessor commubnication. It supports 1D, 2D and 3D (to come soon) domain decomposition schemes. As P3DFFT, P3DFFT++ also relies on lower-level libraries, for example FFTW to perform optimized 1D FFTs. Unlike P3DFFT, which was written in Fortran90, P3DFFT++ is written in C++. Interfaces are provided for C and Fortran. To learn about using the code the user is encouraged to study example programs in C++, C and FORTRAN subdirectories. Please e-mail Dmitry Pekurovsky (dmitry@sdsc.edu) for any questions or suggestions. Software contributions are welcome, assuming they follow the main ideas of the framework.
af-ayala/P3DFFT_Documentation
A public repository to build Read the Docs documentation for P3DFFT. https://github.com/sdsc/p3dfft.3
af-ayala/QInvaders
Quantum Space Invaders
af-ayala/rccl-tests
RCCL Performance Benchmark Tests
af-ayala/rocFFT
Next generation FFT implementation for ROCm
af-ayala/rookiehpc.github.io
A website covering major HPC technologies, designed to welcome contributions.
af-ayala/tensorflow
An Open Source Machine Learning Framework for Everyone
af-ayala/xbyak
a JIT assembler for x86(IA-32)/x64(AMD64, x86-64) MMX/SSE/SSE2/SSE3/SSSE3/SSE4/FPU/AVX/AVX2/AVX-512 by C++ header