Pinned Repositories
ai-matrix
To make it easy to benchmark AI accelerators
AITemplate-1
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
Deep-Learning-for-Recommendation-Systems
This repository contains Deep Learning based articles , paper and repositories for Recommender Systems
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
inference
Reference implementations of inference benchmarks
models
The ONNX Model Zoo is a collection of pre-trained, state-of-the-art models in the ONNX format
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
TensorRT
PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
frank-wei's Repositories
frank-wei/ai-matrix
To make it easy to benchmark AI accelerators
frank-wei/AITemplate-1
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
frank-wei/benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
frank-wei/Deep-Learning-for-Recommendation-Systems
This repository contains Deep Learning based articles , paper and repositories for Recommender Systems
frank-wei/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
frank-wei/inference
Reference implementations of inference benchmarks
frank-wei/models
The ONNX Model Zoo is a collection of pre-trained, state-of-the-art models in the ONNX format
frank-wei/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
frank-wei/TensorRT
PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
frank-wei/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
frank-wei/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
frank-wei/triton-transformer
Implementation of a Transformer, but completely in Triton