Pinned Repositories
AutoTiering
Exploring the Design Space of Page Management for Multi-Tiered Memory Systems (USENIX ATC'21)
bcl
The Berkeley Container Library
CPU-Free-model
https://dl.acm.org/doi/10.1145/3577193.3593713
DeepLearningExamples
Deep Learning Examples
DeepSpeedExamples
Example models using DeepSpeed
DenseNet
Densely Connected Convolutional Networks, In CVPR 2017 (Best Paper Award).
DEVC-17
distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://nervanasystems.github.io/distiller
FpgaNIC
FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs
heterocl
HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing
pixiesunky's Repositories
pixiesunky/bcl
The Berkeley Container Library
pixiesunky/AutoTiering
Exploring the Design Space of Page Management for Multi-Tiered Memory Systems (USENIX ATC'21)
pixiesunky/CPU-Free-model
https://dl.acm.org/doi/10.1145/3577193.3593713
pixiesunky/DeepLearningExamples
Deep Learning Examples
pixiesunky/DeepSpeedExamples
Example models using DeepSpeed
pixiesunky/DenseNet
Densely Connected Convolutional Networks, In CVPR 2017 (Best Paper Award).
pixiesunky/DEVC-17
pixiesunky/distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://nervanasystems.github.io/distiller
pixiesunky/FpgaNIC
FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs
pixiesunky/heterocl
HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing
pixiesunky/heterogeneity-aware-lowering-and-optimization
heterogeneity-aware-lowering-and-optimization
pixiesunky/jittor
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
pixiesunky/malloc
implementation of malloc with mmap()
pixiesunky/MegEngine
MegEngine 是一个快速、可拓展、易于使用且支持自动求导的数值计算框架
pixiesunky/MGG_OSDI23
MGG-Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Multi-GPU Platforms.
pixiesunky/mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
pixiesunky/mtcnn
基于caffe的mtcnn训练实现,可以训练一个自己的有效的目标检测算法,非常容易非常简单,并且有配套的纯c++版本的mtcnn-light
pixiesunky/multi-clock
Sources for the Multi-Clock system as described in the paper: MULTI-CLOCK: Dynamic Tiering for Hybrid Memory Systems, HPCA 2022.
pixiesunky/MVision
机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶
pixiesunky/ngraph
nGraph - open source C++ library, compiler and runtime for Deep Learning
pixiesunky/pynetbuilder
pyNetBuilder is a modular pytonic interface with builtin modules for generating popular caffe prototxt network file definitions.
pixiesunky/pytorch-classification
Classification with PyTorch.
pixiesunky/pytorch-handbook
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
pixiesunky/samples
pixiesunky/SDAccel_Examples
SDAccel Examples
pixiesunky/segmentation_models
Segmentation models with pretrained backbones. Keras and TensorFlow Keras.
pixiesunky/tensorflow
An Open Source Machine Learning Framework for Everyone
pixiesunky/TF2
An Open Source Deep Learning Inference Engine Based on FPGA
pixiesunky/Tiny-DSOD
Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usage
pixiesunky/tpu
Reference models and tools for Cloud TPUs.