Pinned Repositories
MS-AMP
Microsoft Automatic Mixed Precision Library
antares
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore and Intel-OneAPI backends.
horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
superbenchmark
Provide hardware and software benchmarks for AI systems
TensorFlow.Xamarin.Android
Xamarin bindings for the TensorFlow Android Inference library
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation
nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation
guoshzhao's Repositories
guoshzhao/antares
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore and Intel-OneAPI backends.
guoshzhao/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
guoshzhao/nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
guoshzhao/superbenchmark
Provide hardware and software benchmarks for AI systems
guoshzhao/TensorFlow.Xamarin.Android
Xamarin bindings for the TensorFlow Android Inference library
guoshzhao/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
guoshzhao/tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation