JerryWei1985

JerryWei1985's Stars

datawhalechina/learn-nlp-with-transformers
we want to create a repo to illustrate usage of transformers in chinese
Language:Shell2.4k407
google-deepmind/open_x_embodiment
Language:Jupyter Notebook89762
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML4k449
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python3k419
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.8k1k
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Language:Python28.6k3.6k
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Jupyter Notebook7.8k590
thu-ml/SageAttention
Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
Language:Cuda59928
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook5.4k433
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Language:Jupyter Notebook3k240
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Language:Python3.5k375
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Language:C++5.9k667
siliconflow/onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
Language:Jupyter Notebook1.7k107
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.9k75
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Language:Python75256
lucidrains/autoregressive-diffusion-pytorch
Implementation of Autoregressive Diffusion in Pytorch
Language:Python3079
sihyun-yu/REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Language:Python69734
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python3.1k281
3b1b/manim
Animation engine for explanatory math videos
Language:Python71.5k6.3k
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
1.5k44
TianxingChen/Embodied-AI-Guide
具身智能中文指南
52827
kyutai-labs/moshi
Language:Python6.9k539
kohya-ss/sd-scripts
Language:Python5.3k881
dvgodoy/PyTorchStepByStep
Official repository of my book: "Deep Learning with PyTorch Step-by-Step: A Beginner's Guide"
Language:Jupyter Notebook898346
VinAIResearch/LFM
Official PyTorch implementation of the paper: Flow Matching in Latent Space
Language:Python2138
gle-bellier/flow-matching
Annotated Flow Matching paper
Language:Jupyter Notebook1425
XLabs-AI/x-flux
Language:Python1.7k121
Linaqruf/kohya-trainer
Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
Language:Jupyter Notebook1.9k308
ChaofWang/Awesome-Super-Resolution
Collect super-resolution related papers, data, repositories
2.5k355
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.4k70