Pinned Repositories
BPPSA-open
The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".
CSCD70
CSCD70 Compiler Optimization
DietCode
DietCode Code Release
GPU-Virtualization-Benchmarks
Grape-MICRO56-Artifact
This repository contains the source code for Grape.
hfta
Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion
hotline
Minuet
[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs
rlscope
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
Tempo
Memory footprint reduction for transformer models
UofT-EcoSystem's Repositories
UofT-EcoSystem/CSCD70
CSCD70 Compiler Optimization
UofT-EcoSystem/Minuet
[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs
UofT-EcoSystem/DietCode
DietCode Code Release
UofT-EcoSystem/rlscope
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
UofT-EcoSystem/hfta
Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion
UofT-EcoSystem/hotline
UofT-EcoSystem/BPPSA-open
The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".
UofT-EcoSystem/Tempo
Memory footprint reduction for transformer models
UofT-EcoSystem/Grape-MICRO56-Artifact
This repository contains the source code for Grape.
UofT-EcoSystem/GPU-Virtualization-Benchmarks
UofT-EcoSystem/MXNet-GPU_Memory_Profiler
Benchmarking using MXNet GPU Memory Profiler
UofT-EcoSystem/MoIL
MoIL: Enabling Efficient Incremental Training on Edge Devices
UofT-EcoSystem/skyline
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
UofT-EcoSystem/incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
UofT-EcoSystem/algorithmic-efficiency
UofT-EcoSystem/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
UofT-EcoSystem/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
UofT-EcoSystem/cache-trace
A collection of Twitter's anonymized production cache traces.
UofT-EcoSystem/eco_ldap
UofT-EcoSystem/habitat-cu116
UofT-EcoSystem/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
UofT-EcoSystem/rlscope_agents
Fork of https://github.com/tensorflow/agents with RL-Scope annotations added.
UofT-EcoSystem/rlscope_mlperf_training
Fork of https://github.com/mlperf/training with RL-Scope annotations added.
UofT-EcoSystem/rlscope_ReAgent
Fork of https://github.com/facebookresearch/ReAgent with RL-Scope annotations added.
UofT-EcoSystem/rlscope_rl-baselines-zoo
Fork of https://github.com/araffin/rl-baselines-zoo with RL-Scope annotations added.
UofT-EcoSystem/rlscope_stable-baselines
Fork of https://github.com/hill-a/stable-baselines with RL-Scope annotations added.
UofT-EcoSystem/Sylva
UofT-EcoSystem/TensorComprehensions
A domain specific language to express machine learning workloads.
UofT-EcoSystem/tensorflow
An Open Source Machine Learning Framework for Everyone
UofT-EcoSystem/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators