tianleiwu

Microsoft

Pinned Repositories

onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++14.8k 247 6.7k2.9k
onnx
Open standard for machine learning interoperability
Language:Python18k 437 2.8k3.7k
Amuse
.NET application for stable diffusion, Leveraging OnnxStack, Amuse seamlessly integrates many StableDiffusion capabilities all within the .NET eco-system
Language:C#9 0 08
bert
TensorFlow code and pre-trained models for BERT
Language:Python0 0 00
ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
Language:C++00
CNTK
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
Language:C++0 0 00
DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
Language:Jupyter Notebook0 0 00
diffusers
🤗 Diffusers: experiment of diffusion ONNX models
Language:Python0 0 00
Faster-Diffusion
Language:Python0 0 00
Stable-Diffusion-WebUI-OnnxRuntime
Extension for Automatic1111's Stable Diffusion WebUI, using OnnxRuntime CUDA execution provider to deliver high performance result on Nvidia GPU.
Language:Python7 0 00

tianleiwu's Repositories

tianleiwu/Amuse
.NET application for stable diffusion, Leveraging OnnxStack, Amuse seamlessly integrates many StableDiffusion capabilities all within the .NET eco-system
Language:C#9 0 08
tianleiwu/Stable-Diffusion-WebUI-OnnxRuntime
Extension for Automatic1111's Stable Diffusion WebUI, using OnnxRuntime CUDA execution provider to deliver high performance result on Nvidia GPU.
Language:Python7 0 00
tianleiwu/bert
TensorFlow code and pre-trained models for BERT
Language:Python0 0 00
tianleiwu/ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
Language:C++00
tianleiwu/CNTK
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
Language:C++0 0 00
tianleiwu/DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
Language:Jupyter Notebook0 0 00
tianleiwu/diffusers
🤗 Diffusers: experiment of diffusion ONNX models
Language:Python0 0 00
tianleiwu/Faster-Diffusion
Language:Python0 0 00
tianleiwu/gdrivedl
Google Drive Download Python Script
Language:Python0 0 00
tianleiwu/inference
Reference implementations of inference benchmarks
Language:Python0 0 00
tianleiwu/onnx
Open Neural Network Exchange
Language:PureBasic0 0 00
tianleiwu/segment-anything
ONNX Runtime support for SAM
Language:Jupyter Notebook0 0 00
tianleiwu/TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
Language:C++0 0 00
tianleiwu/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Language:Python0 0 00
tianleiwu/tutorials
Tutorials for creating and using ONNX models
Language:Jupyter Notebook0 0 00
tianleiwu/libflash_attn
Standalone Flash Attention v2 kernel without libtorch dependency
Language:C++0 0
tianleiwu/onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
tianleiwu/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++0 0
tianleiwu/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Language:Python0 0
tianleiwu/OrtMultiThreadCSharp
Test ORT with multiple threading
Language:C#1 0
tianleiwu/TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
tianleiwu/unsloth
2-5X faster 70% less memory QLoRA & LoRA finetuning
Language:Python0 0

tianleiwu

Pinned Repositories

onnxruntime

onnx

Amuse

bert

ByteTransformer

CNTK

DemoFusion

diffusers

Faster-Diffusion

Stable-Diffusion-WebUI-OnnxRuntime

tianleiwu's Repositories

tianleiwu/Amuse

tianleiwu/Stable-Diffusion-WebUI-OnnxRuntime

tianleiwu/bert

tianleiwu/ByteTransformer

tianleiwu/CNTK

tianleiwu/DemoFusion

tianleiwu/diffusers

tianleiwu/Faster-Diffusion

tianleiwu/gdrivedl

tianleiwu/inference

tianleiwu/onnx

tianleiwu/segment-anything

tianleiwu/TensorRT

tianleiwu/transformers

tianleiwu/tutorials

tianleiwu/libflash_attn

tianleiwu/onnx-modifier

tianleiwu/onnxruntime

tianleiwu/optimum

tianleiwu/OrtMultiThreadCSharp

tianleiwu/TensorRT-Model-Optimizer

tianleiwu/unsloth