Tianhao-Qi

CV, PhD student@USTC

USTC

Tianhao-Qi's Stars

binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python65.8k 277 1.6k8.1k
harry0703/MoneyPrinterTurbo
利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.
Language:Python18.2k 145 3952.8k
lllyasviel/Omost
Your image is almost there!
Language:Python7.3k 45 81421
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.1k 57 649514
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Language:Jupyter Notebook6k 86 146598
Kwai-Kolors/Kolors
Kolors Team
Language:Python3.9k 39 134273
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python3.5k 43 176296
LLaVA-VL/LLaVA-NeXT
Language:Python2.9k 33 302250
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Language:Python2.3k 39 51281
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Language:Jupyter Notebook1.7k 25 5299
chflame163/ComfyUI_LayerStyle
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
Language:Python1.5k 11 37886
twri/sdxl_prompt_styler
Custom prompt styler node for SDXL in ComfyUI
Language:Python759 9 2875
TencentARC/SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
Language:Python747 15 2858
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Language:Python579 11 7228
TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language:Python490 5 1924
sled-group/InfEdit
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
Language:Python289 5 198
TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
Language:Python253 13 408
AILab-CVC/CV-VAE
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Language:Jupyter Notebook243 14 168
mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
Language:Python212 7 1614
I2V-Adapter/I2V-Adapter-repo
I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models
201 23 64
ali-videoai/Tora
Official repo for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
1676
xinntao/HandyFigure
HandyFigure provides the sources file (ususally PPT files) for paper figures
Language:JavaScript162 7 014
guoqincode/DiT-Visualization
Visualization of DiT self attention features
Language:Python157 12 111
Akaneqwq/360DVD
[CVPR2024] 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
Language:Python121 3 86
Monalissaa/DisenDiff
[CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization
Language:Python84 3 62
zhenglinpan/Awesome-Animation-Research
Papers, datasets, and resources related to 2D cartoon video research. Contributions welcome.
82 13 18
bytedance/Portrait-Mode-Video
Video dataset dedicated to portrait-mode video recognition.
Language:Python37 4 41
Hritikbansal/talc
Language:Python22 2 01
xuyang-liu16/VGDiffZero
[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
Language:Python10 3 00
FaltingsA/SSM
[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
8 2 10

Tianhao-Qi

Tianhao-Qi's Stars

binary-husky/gpt_academic

harry0703/MoneyPrinterTurbo

lllyasviel/Omost

sgl-project/sglang

HVision-NKU/StoryDiffusion

Kwai-Kolors/Kolors

Tencent/HunyuanDiT

LLaVA-VL/LLaVA-NeXT

tencent-ailab/V-Express

YangLing0818/RPG-DiffusionMaster

chflame163/ComfyUI_LayerStyle

twri/sdxl_prompt_styler

TencentARC/SEED-Story

Vchitect/VBench

TianxingWu/FreeInit

sled-group/InfEdit

TencentARC/SmartEdit

AILab-CVC/CV-VAE

mihirp1998/VADER

I2V-Adapter/I2V-Adapter-repo

ali-videoai/Tora

xinntao/HandyFigure

guoqincode/DiT-Visualization

Akaneqwq/360DVD

Monalissaa/DisenDiff

zhenglinpan/Awesome-Animation-Research

bytedance/Portrait-Mode-Video

Hritikbansal/talc

xuyang-liu16/VGDiffZero

FaltingsA/SSM