GD06

Ph.D. student in SEAL Lab, Dept. of Electrical and Computer Engineering at UC, Santa Barbara. Research interests include the computer system and architecture.

UC, Santa Barbarahttps://seal.ece.ucsb.edu/location

Pinned Repositories

caffe
Caffe: a fast open framework for deep learning.
Language:C++0 2 00
caffe-tensorflow
Caffe models in TensorFlow
Language:Python0 2 00
cublas_perf
Testing the performance of the cuBLAS
Language:C++0 2 00
cuda-convnet2
Automatically exported from code.google.com/p/cuda-convnet2
Language:Cuda0 1 150
cudnn-tuning
Codes for auto-tuning cudnn conv forward implementations
Language:Python1 2 00
fathom
Reference workloads for modern deep learning methods.
Language:Python0 2 00
FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++0 0 00
mkldnn-perf
Testing the performance of the MKL-DNN
Language:C++1 2 04
MPU-ASPLOS-2021
Source code of MPU simulator and compiler for ASPLOS 2021 submission.
Language:Python3 2 02
mpu-sim_distribution
Language:Python12 2 09

GD06's Repositories

GD06/mpu-sim_distribution
Language:Python12 2 09
GD06/MPU-ASPLOS-2021
Source code of MPU simulator and compiler for ASPLOS 2021 submission.
Language:Python3 2 02
GD06/cudnn-tuning
Codes for auto-tuning cudnn conv forward implementations
Language:Python1 2 00
GD06/mkldnn-perf
Testing the performance of the MKL-DNN
Language:C++1 2 04
GD06/caffe
Caffe: a fast open framework for deep learning.
Language:C++0 2 00
GD06/caffe-tensorflow
Caffe models in TensorFlow
Language:Python0 2 00
GD06/cublas_perf
Testing the performance of the cuBLAS
Language:C++0 2 00
GD06/cuda-convnet2
Automatically exported from code.google.com/p/cuda-convnet2
Language:Cuda0 1 150
GD06/fathom
Reference workloads for modern deep learning methods.
Language:Python0 2 00
GD06/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++0 0 00
GD06/flash-attention
Fast and memory-efficient exact attention
Language:Python0 0
GD06/GD06.github.io
Homepage
Language:HTML2 0
GD06/gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
Language:C++1 0
GD06/Halide
a language for fast, portable data-parallel computation
Language:C++2 0
GD06/leveldb
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
Language:C++2 0
GD06/models
Models and examples built with TensorFlow
Language:Python2 01
GD06/mpu-homepage
Homepage of the MPU project based on the Cayman theme.
Language:HTML1 0
GD06/mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Language:C++2 0
GD06/NiftyRec
NiftyRec is a software toolbox for Tomographic image reconstruction. NiftyRec is written in C and computationally intensive functions have a GPU accelerated version based on NVidia CUDA. NiftyRec includes a Matlab Toolbox and a Python Package that access the low level routines, hiding the complexity of the GPU accelerated algorithms.
Language:C2 0
GD06/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++0 0
GD06/pytorch-cifar
95.16% on CIFAR10 with PyTorch
Language:Python2 0
GD06/torchrec
Pytorch domain library for recommendation systems
Language:Python0 0
GD06/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python0 0

GD06

Pinned Repositories

caffe

caffe-tensorflow

cublas_perf

cuda-convnet2

cudnn-tuning

fathom

FBGEMM

mkldnn-perf

MPU-ASPLOS-2021

mpu-sim_distribution

GD06's Repositories

GD06/mpu-sim_distribution

GD06/MPU-ASPLOS-2021

GD06/cudnn-tuning

GD06/mkldnn-perf

GD06/caffe

GD06/caffe-tensorflow

GD06/cublas_perf

GD06/cuda-convnet2

GD06/fathom

GD06/FBGEMM

GD06/flash-attention

GD06/GD06.github.io

GD06/gpgpu-sim_distribution

GD06/Halide

GD06/leveldb

GD06/models

GD06/mpu-homepage

GD06/mxnet

GD06/NiftyRec

GD06/pytorch

GD06/pytorch-cifar

GD06/torchrec

GD06/xformers