LOOKCC's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Stability-AI/generative-models
Generative Models by Stability AI
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Stability-AI/StableCascade
Official Code for Stable Cascade
aigc-apps/sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
YaoFANGUK/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
openai/consistencydecoder
Consistency Distilled Diff VAE
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
magic-research/magic-edit
MagicEdit: High-Fidelity Temporally Coherent Video Editing
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
AIrjen/OneButtonPrompt
One Button Prompt
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
hzwer/Practical-RIFE
More practical frame interpolation approach.
lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
MCG-NJU/EMA-VFI
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
kata198/func_timeout
Python module which allows you to specify timeouts when calling any existing function, and support for stoppable threads
JialianW/GRiT
GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)
Zeqiang-Lai/Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
luxiangju-PersonAI/iCartoonFace
iCartoonFace dataset, and baseline approaches, the project is supported by iQIYI
Karine-Huang/T2I-CompBench
[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
tjm35/asymmetric-tiling-sd-webui
Asymmetric Tiling for stable-diffusion-webui
valeoai/Maskgit-pytorch
unofficial MaskGIT reproduction in PyTorch
Yushi-Hu/tifa
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
SmilingWolf/SW-CV-ModelZoo
Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset
vicuna-tools/Stablediffy
Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusion
shangwei5/VIDUE
Joint Video Multi-Frame Interpolation and Deblurring under Unknown Exposure Time (CVPR2023)
openai/dalle3-eval-samples
Text-to-image samples collected for the evaluation of DALL-E 3 in the whitepaper.
RassilonSleeps/MagicPrompt-SD
Web UI for Stable Diffusion prompt generation via GPT-2 trained model