cvpr2023

There are 145 repositories under cvpr2023 topic.

  • amusi/CVPR2024-Papers-with-Code

    CVPR 2024 论文和开源项目合集

  • OpenTalker/SadTalker

    [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

    Language:Python10.8k1467872k
  • baaivision/Painter

    Painter & SegGPT Series: Vision Foundation Models from BAAI

    Language:Python2.4k3665159
  • VainF/Torch-Pruning

    [CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

    Language:Python2.4k32316301
  • tinyvision/SOLIDER

    A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent

    Language:Python1.9k13025341
  • DWCTOD/CVPR2024-Papers-with-Code-Demo

    收集 CVPR 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!Collect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!

  • henghuiding/ReLA

    [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation

    Language:Python65052315
  • Weizhi-Zhong/IP_LAP

    CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

    Language:Python607185468
  • IDEA-Research/OSX

    [CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"

    Language:Python5841612949
  • openscene

    pengsongyou/openscene

    [CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies

    Language:Python566197744
  • ChenFengYe/motion-latent-diffusion

    [CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model

    Language:Python52495548
  • gangweiX/IGEV

    [CVPR 2023] Iterative Geometry Encoding Volume for Stereo Matching and Multi-View Stereo

    Language:Python476296958
  • ankanbhunia/PIDM

    Person Image Synthesis via Denoising Diffusion Model (CVPR 2023)

    Language:Jupyter Notebook468226556
  • rayleizhu/BiFormer

    [CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"

    Language:Python43344834
  • OpenGVLab/VideoMAEv2

    [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

    Language:Python43064942
  • nihaomiao/CVPR23_LFDM

    The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"

    Language:Python425114541
  • ZikangZhou/QCNet

    [CVPR 2023] Query-Centric Trajectory Prediction

    Language:Python398134061
  • youngLBW/HRN

    [CVPR2023] A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images.

    Language:Python394236137
  • GAP-LAB-CUHK-SZ/MVImgNet

    CVPR2023 | MVImgNet: A Large-scale Dataset of Multi-view Images

    Language:Python3538304
  • SysCV/MaskFreeVIS

    Mask-Free Video Instance Segmentation [CVPR 2023]

    Language:Python35381624
  • alibaba/lightweight-neural-architecture-search

    This is a collection of our zero-cost NAS and efficient vision applications.

    Language:Python349123143
  • fabiotosi92/NeRF-Supervised-Deep-Stereo

    A novel paradigm for collecting and generating stereo training data using neural rendering

    Language:Python341164718
  • Haiyang-W/DSVT

    [CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"

    Language:Python33897327
  • nianticlabs/ace

    [CVPR 2023 - Highlight] Accelerated Coordinate Encoding (ACE): Learning to Relocalize in Minutes using RGB and Poses

    Language:Python328122930
  • kxhit/vMAP

    [CVPR 2023] vMAP: Vectorised Object Mapping for Neural Field SLAM

    Language:Python325142220
  • hzwer/CVPR2023-DMVFN

    CVPR2023 (highlight) - A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

    Language:Jupyter Notebook3198149
  • theEricMa/OTAvatar

    This is the official repository for OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering [CVPR2023].

    Language:Python297112933
  • DmitryRyumin/CVPR-2023-24-Papers

    CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

    Language:Python2927019
  • dvlab-research/SphereFormer

    The official implementation for "Spherical Transformer for LiDAR-based 3D Recognition" (CVPR 2023).

    Language:Python28257133
  • MendelXu/SAN

    Open-vocabulary Semantic Segmentation

    Language:Python27865525
  • yiqun-wang/PET-NeuS

    PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces (CVPR 2023)

    Language:Python25419165
  • Brummi/BehindTheScenes

    Official implementation of the paper: Behind the Scenes: Density Fields for Single View Reconstruction (CVPR 2023)

    Language:Python234133217
  • PJLab-ADG/LoGoNet

    [CVPR2023] LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion

    Language:Python23464115
  • CVMI-Lab/PLA

    (CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

    Language:Python220134211
  • Advocate99/DiffGesture

    [CVPR 2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

    Language:Python212122415
  • MCG-NKU/AMT

    Official code for "AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation" (CVPR2023)

    Language:Python20561716