yangbinb

Pinned Repositories

2015-terminal-interview-git
SYSU Apple Club - 2015 - Terminal Department - Second Round Interview - Git Learning
0 2 00
aphantasia
CLIP + FFT/DWT/RGB = text to image/video
Language:Python0 0 00
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python0 0 00
books
useful books
0 0 00
BoxDiff
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Language:Python0 0 00
CLIP
Contrastive Language-Image Pretraining
Language:Jupyter Notebook0 1 00
Crop-CLIP
Crop using CLIP
Language:Jupyter Notebook1 1 00
Distributed-PC-Darts
Distributed implementation of PC-Darts.This code is based on the implementation of PC-Darts, it is able to searching and training on multi-nodes&multi-gpu with the method of distributed data parallel.Only the distributed search and retrain on Cifar10 implemented, you can modify it for your own datasets.
Language:Python1 1 00
glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
Language:Python1 1 00
SalMetric
An evaluation python program for salient map result
Language:Python5 2 00

yangbinb's Repositories

yangbinb/aphantasia
CLIP + FFT/DWT/RGB = text to image/video
Language:Python0 0 00
yangbinb/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python0 0 00
yangbinb/BoxDiff
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Language:Python0 0 00
yangbinb/CLIP_prefix_caption
Simple image captioning model
Language:Jupyter Notebook0 0 00
yangbinb/CogVideo
Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"
Language:Python0 0 00
yangbinb/CoDeF
Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Language:Python0 0
yangbinb/ComfyUI-DragNUWA
Language:Python0 0
yangbinb/ComfyUI-Marigold
Marigold depth estimation in ComfyUI
Language:Python0 0
yangbinb/CVPR23_LFDM
The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"
Language:Python0 0
yangbinb/DirectInversion
Official repo for paper "Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
Language:Jupyter Notebook0 0
yangbinb/Director3D
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text".
Language:Python0 0
yangbinb/DragNUWA
Language:Python0 0
yangbinb/DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python0 0
yangbinb/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion
Language:Python0 0
yangbinb/lorahub
Language:Python0 0
yangbinb/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
Language:Jupyter Notebook0 0
yangbinb/Omost
Your image is almost there!
yangbinb/Open-Sora
Building your own video generation model like OpenAI's Sora
Language:Python0 0
yangbinb/rich-text-to-image
Rich-Text-to-Image Generation
Language:Python0 0
yangbinb/stable-diffusion
Language:Jupyter Notebook0 0
yangbinb/svd-temporal-controlnet
Language:Python0 0
yangbinb/T-Rex
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Language:Python0 0
yangbinb/Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
Language:Python0 0
yangbinb/TrackDiffusion
Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
yangbinb/TTNet-Real-time-Analysis-System-for-Table-Tennis-Pytorch
Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)
Language:Python0 0
yangbinb/vector-quantize-pytorch
Vector Quantization, in Pytorch
Language:Python0 0
yangbinb/Video-BLIP2-Preprocessor
A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
Language:Python0 0
yangbinb/vidmaestro.github.io
Language:JavaScript1 0
yangbinb/WaveDiff
Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)
Language:Python0 0
yangbinb/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)
Language:Python0 0