Pinned Repositories
starcoder
Home of StarCoder: fine-tuning & inference!
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
RCRnorm
starcoder
Home of StarCoder: fine-tuning & inference!
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
jiagaoxiang's Repositories
jiagaoxiang/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
jiagaoxiang/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
jiagaoxiang/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
jiagaoxiang/RCRnorm
jiagaoxiang/starcoder
Home of StarCoder: fine-tuning & inference!