Dufresue's Stars
zhengye1995/kesci-2021-underwater-optics
2021和鲸水下光学目标检测智能算法赛项A榜0.569 B榜0.568
filby89/body-face-emotion-recognition
Code for the paper "Fusing Body Posture with Facial Expressions for Joint Recognition of Affect in Child-Robot Interaction"
filby89/multimodal-emotion-recognition
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
liyupi/yuzi-generator
基于 React + Spring Boot + Picocli + 对象存储的代码生成器共享平台,又分为 3 个循序渐进的子项目:基于命令行的本地代码生成器 + 代码生成器制作工具 + 在线代码生成器平台。实践 Java 命令行应用开发、FreeMarker 模板引擎、多种设计模式、对象存储、十几种优化方法、复杂业务的拆解和系统设计、分布式任务调度系统、Vert.x 响应式编程等
GT-RIPL/Xmodal-Ctx
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
LeapLabTHU/Agent-Attention
Official repository of Agent Attention (ECCV2024)
pzzhang/VinVL
project page for VinVL
qhfan/RMT
(CVPR2024)RMT: Retentive Networks Meet Vision Transformer
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
transformer-vq/transformer_vq
msracver/Deformable-ConvNets
Deformable Convolutional Networks
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
AykutSarac/jsoncrack.com
✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs.
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
airsplay/py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
rmokady/CLIP_prefix_caption
Simple image captioning model
ruotianluo/ImageCaptioning.pytorch
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
karpathy/neuraltalk
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
mad-red/VSR-guided-CIC
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
aimagelab/show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
yzfly/CVPR2023_Top_Open_Papers
This repository is a curated collection of the most exciting and influential CVPR 2023 opensource works [Paper + Code].🔥