hlu1

@facebook

Pinned Repositories

AITemplate_public
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python1 0 00
caffe2
Caffe2 is a lightweight, modular, and scalable deep learning framework.
Language:C++0 0 00
cpuinfo
CPU INFOrmation library (x86/ARM, Linux/Mach/NaCl)
Language:Objective-C0 1 00
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++00
dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
Language:C++00
onnx
Open Neural Network Exchange
Language:PureBasic10
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++2 1 00
QNNPACK
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
Language:C1 1 00
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python20

hlu1's Repositories

hlu1/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++2 1 00
hlu1/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python20
hlu1/AITemplate_public
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python1 0 00
hlu1/onnx
Open Neural Network Exchange
Language:PureBasic10
hlu1/QNNPACK
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
Language:C1 1 00
hlu1/caffe2
Caffe2 is a lightweight, modular, and scalable deep learning framework.
Language:C++0 0 00
hlu1/cpuinfo
CPU INFOrmation library (x86/ARM, Linux/Mach/NaCl)
Language:Objective-C0 1 00
hlu1/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++00
hlu1/dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
Language:C++00
hlu1/KeepingYouAwake
Prevents your Mac from going to sleep.
Language:Objective-C
hlu1/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
hlu1/models
A repository for storing pre-trained Caffe2 models.
Language:PureBasic
hlu1/TASO
A Tensor Algebra SuperOptimizer for Deep Learning
Language:C++0 0
hlu1/tvm-samples
Language:Python1 0
hlu1/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python

hlu1

Pinned Repositories

AITemplate_public

caffe2

cpuinfo

cutlass

dmlc-core

onnx

pytorch

QNNPACK

tvm

hlu1's Repositories

hlu1/pytorch

hlu1/tvm

hlu1/AITemplate_public

hlu1/onnx

hlu1/QNNPACK

hlu1/caffe2

hlu1/cpuinfo

hlu1/cutlass

hlu1/dmlc-core

hlu1/KeepingYouAwake

hlu1/minGPT

hlu1/models

hlu1/TASO

hlu1/tvm-samples

hlu1/vllm