PanXiebit

Beihang UniversityBeijing

PanXiebit's Stars

black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python14.2k 125 1301k
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python2.7k 93 86248
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
Language:Python2.6k 18 175163
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
Language:Jupyter Notebook2.4k 16 83523
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2k 27 163132
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k 26 46108
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.2k 17 8588
jeanfeydy/geomloss
Geometric loss functions between point clouds, images and volumes
Language:Python586 13 7257
catcathh/UltraPixel
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
Language:Python52118
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Language:Python512 6 2020
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Language:Python467 6 2419
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Language:Python403 17 1922
G-U-N/Phased-Consistency-Model
Boosting the performance of consistency models with PCM!
Language:Python338 21 1711
jannerm/ddpo
Code for the paper "Training Diffusion Models with Reinforcement Learning"
Language:Python321 7 1125
NVlabs/I2SB
Language:Python247 5 1422
aim-uofa/MovieDreamer
239 21 39
Huage001/LinFusion
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
Language:Python20512
RLHF-V/RLAIF-V
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
Language:Python202 4 276
locuslab/ect
Consistency Models Made Easy
Language:Python196 6 137
yk7333/d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Language:Python162 7 1414
Yuanshi9815/Video-Infinity
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
Language:Python158 1 1015
meder411/PyTorch-EMDLoss
PyTorch 1.0 implementation of the approximate Earth Mover's Distance
Language:Cuda134 5 1013
thu-ml/Bridge-TTS
Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).
120 39 41
chen-wl20/DreamCinema
DreamCinema: Cinematic Transfer with Free Camera and 3D Character
791
ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Language:Python79 5 115
LituRout/OptimalTransportModeling
The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.
Language:Jupyter Notebook54 2 29
XJTU-XGU/OTCS
Code for "Optimal Transport-Guided Conditional Score-Based Diffusion Model (NeurIPS, 8,7,7,6)"
Language:Jupyter Notebook46 1 02
AI-Study-Han/Zero-Qwen-VL
训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。
Language:Python242
QUVA-Lab/SIGMA
Language:Python9 8 00
SignDiff/Processed-Data
Preprocessed data of SignDiff: Learning Diffusion Models for American Sign Language Production
Language:Python8 2 31

PanXiebit

PanXiebit's Stars

black-forest-labs/flux

gpt-omni/mini-omni

modelscope/data-juicer

boyu-ai/Hands-on-RL

THUDM/CogVLM2

facebookresearch/chameleon

aigc-apps/EasyAnimate

jeanfeydy/geomloss

catcathh/UltraPixel

buoyancy99/diffusion-forcing

Alpha-VLLM/Lumina-mGPT

Vchitect/VEnhancer

G-U-N/Phased-Consistency-Model

jannerm/ddpo

NVlabs/I2SB

aim-uofa/MovieDreamer

Huage001/LinFusion

RLHF-V/RLAIF-V

locuslab/ect

yk7333/d3po

Yuanshi9815/Video-Infinity

meder411/PyTorch-EMDLoss

thu-ml/Bridge-TTS

chen-wl20/DreamCinema

ZebangCheng/Emotion-LLaMA

LituRout/OptimalTransportModeling

XJTU-XGU/OTCS

AI-Study-Han/Zero-Qwen-VL

QUVA-Lab/SIGMA

SignDiff/Processed-Data