Dufresue

Dufresue's Stars

zhengye1995/kesci-2021-underwater-optics
2021和鲸水下光学目标检测智能算法赛项A榜0.569 B榜0.568
Language:Python8523
filby89/body-face-emotion-recognition
Code for the paper "Fusing Body Posture with Facial Expressions for Joint Recognition of Affect in Child-Robot Interaction"
Language:Python195
filby89/multimodal-emotion-recognition
Language:Python42
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Language:C++30.9k7.8k
liyupi/yuzi-generator
基于 React + Spring Boot + Picocli + 对象存储的代码生成器共享平台，又分为 3 个循序渐进的子项目：基于命令行的本地代码生成器 + 代码生成器制作工具 + 在线代码生成器平台。实践 Java 命令行应用开发、FreeMarker 模板引擎、多种设计模式、对象存储、十几种优化方法、复杂业务的拆解和系统设计、分布式任务调度系统、Vert.x 响应式编程等
Language:Java487105
GT-RIPL/Xmodal-Ctx
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Language:Python6010
LeapLabTHU/Agent-Attention
Official repository of Agent Attention (ECCV2024)
Language:Python49035
pzzhang/VinVL
project page for VinVL
34925
qhfan/RMT
(CVPR2024)RMT: Retentive Networks Meet Vision Transformer
Language:Python27418
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Language:Python2.3k265
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python133k26.5k
transformer-vq/transformer_vq
Language:Python17412
msracver/Deformable-ConvNets
Deformable Convolutional Networks
Language:Python4k959
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language:Python13.7k2k
AykutSarac/jsoncrack.com
✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs.
Language:TypeScript30.6k1.9k
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
Language:Python3.8k442
airsplay/py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
Language:Python22956
aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Language:Python516136
davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
Language:Python17827
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.7k953
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Language:Python4.1k416
rmokady/CLIP_prefix_caption
Simple image captioning model
Language:Jupyter Notebook1.3k214
ruotianluo/ImageCaptioning.pytorch
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
Language:Python1.4k412
karpathy/neuraltalk
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
Language:Python5.4k1.3k
peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Language:Jupyter Notebook1.4k379
mad-red/VSR-guided-CIC
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
Language:Python364
aimagelab/show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Language:Python28261
sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Language:Python2.8k711
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
17.9k2.6k
yzfly/CVPR2023_Top_Open_Papers
This repository is a curated collection of the most exciting and influential CVPR 2023 opensource works [Paper + Code].🔥
Language:HTML605