DuoPeng-CVer's Stars
zhengxuJosh/360SFUDA
Code for Panoramic Semantic Segmentation
Tramac/awesome-semantic-segmentation-pytorch
Semantic Segmentation on PyTorch (include FCN, PSPNet, Deeplabv3, Deeplabv3+, DANet, DenseASPP, BiSeNet, EncNet, DUNet, ICNet, ENet, OCNet, CCNet, PSANet, CGNet, ESPNet, LEDNet, DFANet)
yunlongdong/FCN-pytorch
Another pytorch implementation of FCN (Fully Convolutional Networks)
pochih/FCN-pytorch
🚘 Easiest Fully Convolutional Networks
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
luciddreamer-cvlab/LucidDreamer
Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".
GONGJIA0208/Diffpose
[CVPR 2023] DiffPose: Toward More Reliable 3D Pose Estimation
GONGJIA0208/Diffpose_video
The code of DIffpose video setting
meta-llama/llama
Inference code for Llama models
WUSTL-CSPL/RIATIG
Jaykef/AvaChat
AvaChat - is a realtime AI chat demo with animated talking heads - it uses Large Language Models (GPT, API2D GPT4, Cluade) as text inputs to D-ID's image-to-video talking head model (via D-ID stream api)
yu-takagi/StableDiffusionReconstruction
Takagi and Nishimoto, CVPR 2023
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
yuval-alaluf/Attend-and-Excite
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
benkyoujouzu/stable-diffusion-webui-visualize-cross-attention-extension
CompVis/stable-diffusion
A latent text-to-image diffusion model
shape-guided-diffusion/shape-guided-diffusion
Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024
SusungHong/Self-Attention-Guidance
The implementation of the paper "Improving Sample Quality of Diffusion Models Using Self-Attention Guidance" (ICCV`23)
google/prompt-to-prompt
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
cvlab-stonybrook/LearningToCountEverything
luminxu/Pose-for-Everything
The official repo for ECCV'22 paper: Pose for Everything: Towards Category-Agnostic Pose Estimation
52CV/CVPR-2023-Papers
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
nakaizura/Awesome-Cross-Modal-Video-Moment-Retrieval
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
AILab-CVC/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
changdaeoh/BlackVIP
Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"
nagadomi/waifu2x
Image Super-Resolution for Anime-Style Art
Viliami/kmeans-cluster
Python visualization of k-means clustering