Pinned Repositories
hwloc
Hardware locality (hwloc). This is a fork that includes the extra patches needed by the MPICH project.
MMCompiler
parlooper
PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolutions and Fused Deep Learning Primitives
KavithaTipturMadhu's Repositories
KavithaTipturMadhu/MMCompiler
KavithaTipturMadhu/parlooper
PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolutions and Fused Deep Learning Primitives
KavithaTipturMadhu/hwloc
Hardware locality (hwloc). This is a fork that includes the extra patches needed by the MPICH project.
KavithaTipturMadhu/HypercellCompiler
Hypercell Compiler
KavithaTipturMadhu/incubator-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
KavithaTipturMadhu/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
KavithaTipturMadhu/mpich
Official MPICH Repository
KavithaTipturMadhu/netloc
Edits to netloc 0.5 release to work with mpich
KavithaTipturMadhu/plaidml
PlaidML is a framework for making deep learning work everywhere.
KavithaTipturMadhu/rcp
KavithaTipturMadhu/RCP-1
The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to discover bottlenecks in the application and to find ways to optimize the application's performance.
KavithaTipturMadhu/tensorflow
An Open Source Machine Learning Framework for Everyone
KavithaTipturMadhu/tensorflowwhl
KavithaTipturMadhu/tpp-sandbox
KavithaTipturMadhu/workstealing