FuLy2002's Stars
wangzhifengharrison/HTNet
wangyuchi369/LaDiC
[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?
krahets/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
drboog/Shifted_Diffusion
Code for Shifted Diffusion for Text-to-image Generation (CVPR 2023)
jianjieluo/PCM-Net
[ECCV24] Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
ailab-kyunghee/CM2_DVC
[CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval
SatyamGaba/image_captioning
Image Captioning with CNN, LSTM and RNN using PyTorch on COCO Dataset
jianjieluo/SCD-Net
[CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion model with additional semantic prior.
aanna0701/SPT_LSA_ViT
Implementation of Visual Transformer for Small-size Datasets
whwu95/Cap4Video
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
liujf69/EPP-Net-Action
[TRIT 2024] Implementation of the paper “Explore Human Parsing Modality for Action Recognition”.
YashiGoyal-02/Smart-Bridge-Yoga-Pose-Classification
The project "Yoga Pose Classification" aims to develop a system which can classify various yoga poses from static images and real-time poses captured through a camera.
frank-1150/frank-1150.github.io
try to clone Tesla.com using HTML, JS and CSS
rishikksh20/ViViT-pytorch
Implementation of ViViT: A Video Vision Transformer
uark-cviu/Micron-BERT
[CVPR 2023] Micron-BERT: BERT-based Facial Micro-Expression Recognition
mingyuefusu/library_manage_system
使用jsp、layui、mysql完成的图书馆系统,包含用户图书借阅、图书管理员、系统管理员界面,功能齐全
GroverZhu/Online-Library-System
基于MVC设计模式的在线图书馆管理系统
ahangchen/torch_base
Quickly bring up your PyTorch project(a skeleton)
songquanpeng/pytorch-template
To be the world's best PyTorch project template.
leftatrium2/AIDemo
深度学习,CV 例子项目,人体部分,包含:姿势骨架、手势骨架、姿势识别、手势识别、面部侦测等等
IW276/IW276WS20-P12
2D Pose Based Action Recognition
kenshohara/3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
Rahul5430/Speech-Emotion-Recognition-System
It is a system through which various audio speech files are classified into different emotions such as happy, sad, anger and neutral by computer. SER can be used in areas such as the medical field or customer call centers.
CS-BAOYAN/CS-BAOYAN-2023