xinntao's Stars
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
TencentARC/PhotoMaker
PhotoMaker
windingwind/zotero-pdf-translate
Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.
threestudio-project/threestudio
A unified framework for 3D content generation.
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Doubiiu/DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
pytorch/torchtitan
A native PyTorch Library for large model training
TencentARC/MotionCtrl
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
TencentARC/BrushNet
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
mayuelala/FollowYourPose
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
google-research/pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
AILab-CVC/GPT4Tools
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.
MC-E/DragonDiffusion
ICLR 2024 (Spotlight)
TencentARC/MasaCtrl
[ICCV 2023] Consistent Image Synthesis and Editing
AILab-CVC/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
iejMac/video2dataset
Easily create large video dataset from video urls
bbaaii/DreamDiffusion
Implementation of “DreamDiffusion: Generating High-Quality Images from Brain EEG Signals”
snap-research/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
TencentARC/Mix-of-Show
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
showlab/VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
mira-space/Mira
AILab-CVC/SEED-Bench
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
kaleido-lab/dolphin
General video interaction platform based on LLMs, including Video ChatGPT
mtli/HTML4Vision
A simple HTML visualization tool for computer vision research :hammer_and_wrench:
TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
mira-space/MiraData
TencentARC/DeSRA
Official codes for DeSRA (ICML 2023)
xt4d/CameraViewer
A lightweight tool for camera pose visualization