17Skye17's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
simonw/llm-claude-3
LLM plugin for interacting with the Claude 3 family of models
CiaraStrawberry/svd-temporal-controlnet
hellozhuo/pidinet
Code for the ICCV 2021 paper "Pixel Difference Networks for Efficient Edge Detection" (Oral).
roboflow/supervision
We write your reusable computer vision tools. 💜
GenImage-Dataset/GenImage
iejMac/video2dataset
Easily create large video dataset from video urls
ffhibnese/Model-Inversion-Attack-ToolBox
A comprehensive toolbox for model inversion attacks and defenses, which is easy to get started.
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
AILab-CVC/TaleCrafter
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
snap-research/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
evalcrafter/EvalCrafter
[CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
celebv-text/CelebV-Text
(CVPR 2023) CelebV-Text: A Large-Scale Facial Text-Video Dataset
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
yudianzheng/SketchVideo
[EG 2023] Sketch Video Synthesis
aim-uofa/AutoStory
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
haoningwu3639/StoryGen
[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
nateraw/stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
MoyGcc/vid2avatar
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)
lichao-sun/SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
ControlNet/AV-Deepfake1M
[ACM MM Award] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
wl-zhao/DiffSwap
[CVPR 2023] DiffSwap is a diffusion-based face-swapping framework.
Gourieff/sd-webui-reactor
Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)
Scholar01/sd-webui-mov2mov
This is the Mov2mov plugin for Automatic1111/stable-diffusion-webui.
lisiyao21/AnimeInbet
Code and data for ICCV23 work "Deep Geometrized Cartoon Line Inbetweening"
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
nilslukas/gan-watermark
Watermark for Image Generators
rshaojimmy/MultiModal-DeepFake
[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond