Tianhao-Qi's Stars
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
lllyasviel/Omost
Your image is almost there!
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Kwai-Kolors/Kolors
Kolors Team
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
LLaVA-VL/LLaVA-NeXT
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
chflame163/ComfyUI_LayerStyle
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
twri/sdxl_prompt_styler
Custom prompt styler node for SDXL in ComfyUI
TencentARC/SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
sled-group/InfEdit
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
AILab-CVC/CV-VAE
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
I2V-Adapter/I2V-Adapter-repo
I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models
ali-videoai/Tora
Official repo for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
xinntao/HandyFigure
HandyFigure provides the sources file (ususally PPT files) for paper figures
guoqincode/DiT-Visualization
Visualization of DiT self attention features
Akaneqwq/360DVD
[CVPR2024] 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
Monalissaa/DisenDiff
[CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization
zhenglinpan/Awesome-Animation-Research
Papers, datasets, and resources related to 2D cartoon video research. Contributions welcome.
bytedance/Portrait-Mode-Video
Video dataset dedicated to portrait-mode video recognition.
Hritikbansal/talc
xuyang-liu16/VGDiffZero
[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
FaltingsA/SSM
[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition