Pinned Repositories
amd-rocm-codelabs
Codelabs to help you get started with portable GPU acceleration
amd-rocm-codelabs_example-codes
darknet
YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
hackathon-docs
ocean-rcc
VM image builds for the OS HPC cluster
p3dfft.3
P3DFFT++ (a.k.a. P3DFFT v. 3) is a new generation of P3DFFT library that aims to provide a comprehensive framework for simulating multiscale phenomena. It takes the essence of P3DFFT further by creating an extensible, modular structure uniquely adaptable to a greater range of use cases. The users can specify in detail what kind of data layout they would like to use, both in terms of local memory ordering and the processor layout. Just like P3DFFT, P3DFFT++ is a distributed software package, using MPI as the primary method for interprocessor commubnication. It supports 1D, 2D and 3D (to come soon) domain decomposition schemes. As P3DFFT, P3DFFT++ also relies on lower-level libraries, for example FFTW to perform optimized 1D FFTs. Unlike P3DFFT, which was written in Fortran90, P3DFFT++ is written in C++. Interfaces are provided for C and Fortran. To learn about using the code the user is encouraged to study example programs in C++, C and FORTRAN subdirectories. Please e-mail Dmitry Pekurovsky (dmitry@sdsc.edu) for any questions or suggestions. Software contributions are welcome, assuming they follow the main ideas of the framework.
porTTgpu
PORtable Tensor Transpose for gpu's
OS Hackathon's Repositories
os-hackathon/darknet
YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
os-hackathon/amd-rocm-codelabs
Codelabs to help you get started with portable GPU acceleration
os-hackathon/amd-rocm-codelabs_example-codes
os-hackathon/hackathon-docs
os-hackathon/ocean-rcc
VM image builds for the OS HPC cluster
os-hackathon/p3dfft.3
P3DFFT++ (a.k.a. P3DFFT v. 3) is a new generation of P3DFFT library that aims to provide a comprehensive framework for simulating multiscale phenomena. It takes the essence of P3DFFT further by creating an extensible, modular structure uniquely adaptable to a greater range of use cases. The users can specify in detail what kind of data layout they would like to use, both in terms of local memory ordering and the processor layout. Just like P3DFFT, P3DFFT++ is a distributed software package, using MPI as the primary method for interprocessor commubnication. It supports 1D, 2D and 3D (to come soon) domain decomposition schemes. As P3DFFT, P3DFFT++ also relies on lower-level libraries, for example FFTW to perform optimized 1D FFTs. Unlike P3DFFT, which was written in Fortran90, P3DFFT++ is written in C++. Interfaces are provided for C and Fortran. To learn about using the code the user is encouraged to study example programs in C++, C and FORTRAN subdirectories. Please e-mail Dmitry Pekurovsky (dmitry@sdsc.edu) for any questions or suggestions. Software contributions are welcome, assuming they follow the main ideas of the framework.
os-hackathon/porTTgpu
PORtable Tensor Transpose for gpu's