Pinned Repositories
Grendel
Code for generating molecular force fields in internal coordinates
legate-hello-world
A hello world for Legate programs
PySkyNet
sprockit
C++ toolkit for parameter reading, serialization, output, etc
kokkos
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
kokkos-kernels
Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
sst-core
SST Structural Simulation Toolkit Parallel Discrete Event Core and Services
sst-macro
SST Macro Element Library
Trilinos
Primary repository for the Trilinos Project
jjwilke's Repositories
jjwilke/legate-hello-world
A hello world for Legate programs
jjwilke/sst-macro
SST Macro Element Library
jjwilke/sst-transports
A set of communication transports running in SST rather than real systems
jjwilke/cunumeric
An Aspiring Drop-In Replacement for NumPy at Scale
jjwilke/ember
Ember Communication Patterns
jjwilke/gsites-research-highlights
jjwilke/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
jjwilke/jaxite
jjwilke/kokkos
Kokkos C++ Performance Portability Programming EcoSystem: The Programming Model - Parallel Execution and Memory Abstraction
jjwilke/kokkos-kernels
Kokkos C++ Performance Portability Programming EcoSystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
jjwilke/kokkos-mpi-test
A test to make sure MPI + Kokkos wrappers are working
jjwilke/kokkos-nvcc-wrapper
The NVCC wrapper used by Kokkos
jjwilke/kokkos-spack
kokkos-spack
jjwilke/kokkos-tutorials
Tutorials for the Kokkos C++ Performance Portability Programming EcoSystem
jjwilke/legate.core
The Foundation for All Legate Libraries
jjwilke/lernen
jjwilke/libfabric
Open Fabric Interfaces
jjwilke/nccl
Optimized primitives for collective multi-GPU communication
jjwilke/paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
jjwilke/praxis
jjwilke/spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
jjwilke/sst-core
SST Structural Simulation Toolkit Parallel Discrete Event Core and Services
jjwilke/sst-elements
SST Architectural Simulation Components and Libraries
jjwilke/sst-tpls
SST Third Party Libraries
jjwilke/sst-ugni
An implementation of uGNI for the Structural Simulation Toolkit (SST)
jjwilke/tensorflow
An Open Source Machine Learning Framework for Everyone
jjwilke/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
jjwilke/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
jjwilke/Trilinos
Primary repository for the Trilinos Project
jjwilke/xla
A machine learning compiler for GPUs, CPUs, and ML accelerators