vinesmsuic
You're doing what you love. Isn't that that enough? Isn't that that enough?
⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀
Pinned Repositories
DreamEdit
Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)
AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]
ImagenHub
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]
TheoremExplainAgent
Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]
VideoGenHub
A one-stop library to standardize the inference and evaluation of all the conditional video generation models.
VIEScore
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024)
Depth-Cameras
How to setup, or use any depth cameras in ROS (Starter)
ipainter-diffusion
Official Code for "Intelligent Painter: Picture Composition With Resampling Diffusion Model" (ICIP 2023)
Pcap-To-Img
Modified version of USTC-TK2016: Toolkit for processing PCAP file and transform into image data for training
White-box-Cartoonization-PyTorch
PyTorch implementation of “Learning to Cartoonize Using White-box Cartoon Representations” (CVPR 2020). Now with gradio demo
vinesmsuic's Repositories
vinesmsuic/ipainter-diffusion
Official Code for "Intelligent Painter: Picture Composition With Resampling Diffusion Model" (ICIP 2023)
vinesmsuic/ImagenHub
(ICLR2024) Official Code for "ImagenHub: Standardizing the evaluation of conditional image generation models"
vinesmsuic/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
vinesmsuic/flow_matching
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
vinesmsuic/InstantStyle
vinesmsuic/pexels.com-bulk-downloads-videos
Download bulks videos on pexels.com with this simple Python script.
vinesmsuic/TeamFortress2-Shader
A Rasterization+Raytracing renderer toy. Implemented the shader of the game TF2 with only C++, no OpenGL
vinesmsuic/vinesmsuic
Profile
vinesmsuic/AnyV2V
Perform Video Editing with only one image. Now with gradio demo
vinesmsuic/awesome-tips
vinesmsuic/BIVDiff
[CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
vinesmsuic/clipseg
This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".
vinesmsuic/ControlNet
Let us control diffusion models
vinesmsuic/diffengine
Diffusers training with mmengine
vinesmsuic/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
vinesmsuic/DreamDistribution
vinesmsuic/GPU-Puzzles
Solve puzzles. Learn CUDA.
vinesmsuic/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
vinesmsuic/inspiration_tree
vinesmsuic/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
vinesmsuic/lang-segment-anything
SAM with text prompt
vinesmsuic/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
vinesmsuic/my-github-stats
vinesmsuic/Paints-UNDO
Understand Human Behavior to Align True Needs
vinesmsuic/SRC
SRC(Simulation RPG Construction)の C# .NET への移植版 SRC#(Simulation RPG Construction Sharp)
vinesmsuic/style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
vinesmsuic/supervision
We write your reusable computer vision tools. 💜
vinesmsuic/VideoGenHub
A one-stop library to standardize the inference and evaluation of all the conditional video generation models.
vinesmsuic/Visual-Style-Prompting
Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"
vinesmsuic/VPD
VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.