Aaron: Compile-time Kernel Adaptation for Multi-DNN Inference Acceleration on Edge GPU [SenSys'22 Best Poster]
Primary LanguageCuda