Pinned Repositories
adapt-image-models
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
aqa_tpt
implementation of "Action Quality Assessment with Temporal Parsing Transformer"
BilibiliVideoDownload
Cross-platform download bilibili video desktop software, support windows, macOS, Linux
Bridge-Prompt
[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
chat-project-based-on-ubuntu
使用C++实现的ubuntu环境下的聊天小项目,采用C/S架构,支持注册、登录、记录登录状态、私聊、群聊功能,前期使用多线程实现并发服务器,后期利用epoll监听+线程池处理的Reactor模式实现并发服务器,进行了压力测试,并采用 bitmap 实现的布隆过滤器减少对 MySQL 的查询。项目中使用TCP网络编程实现C/S的信息交互,使用Mysql记录用户账号、密码,使用redis记录用户的登录状态,编写了makefile进行编译,使用shell脚本提高了开发效率,开发过程使用git进行版本管理,编写了说明文档。
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
CoRe
[ICCV 2021] Group-aware Contrastive Regression for Action Quality Assessment
dalle2-laion
Pretrained Dalle2 from laion
Leetcode-Problemlist
z-w-wang's Repositories
z-w-wang/Leetcode-Problemlist
z-w-wang/adapt-image-models
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
z-w-wang/aqa_tpt
implementation of "Action Quality Assessment with Temporal Parsing Transformer"
z-w-wang/BilibiliVideoDownload
Cross-platform download bilibili video desktop software, support windows, macOS, Linux
z-w-wang/Bridge-Prompt
[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
z-w-wang/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
z-w-wang/CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
z-w-wang/dalle2-laion
Pretrained Dalle2 from laion
z-w-wang/DeVLBert
DeVLBert: Learning Deconfounded Visio-Linguistic Representations
z-w-wang/DTLR
Handwritten Text Recognition and Character Detection
z-w-wang/FineDiving
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
z-w-wang/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
z-w-wang/long-short-term-transformer
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
z-w-wang/Machine-Learning
机器学习原理
z-w-wang/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
z-w-wang/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
z-w-wang/MTL-AQA
What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
z-w-wang/Online-Action-Detection
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.
z-w-wang/QRNet
z-w-wang/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
z-w-wang/rulstm
Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention. International Conference on Computer Vision, 2019.
z-w-wang/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
z-w-wang/stochastic-backpropagation
Some Setting about LSTR
z-w-wang/SVIP-Sequence-VerIfication-for-Procedures-in-Videos
[CVPR2022] SVIP: Sequence VerIfication for Procedures in Videos
z-w-wang/TeSTra
Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
z-w-wang/TSA-Net
[ACM MM 2021] TSA-Net: Tube Self-Attention Network for Action Quality Assessment
z-w-wang/TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
z-w-wang/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
z-w-wang/VLTVG
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
z-w-wang/WSVOG_Causal_Intervention
Weakly-Supervised Video Object Grounding via Causal Intervention_2023 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE