catfish132's Stars
ghsama/ConvTransformerTimeSeries
Convolutional Transformer for time series
catfish132/REAP
this is the repository for REAP
catfish132/DiffusionRRG
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
niehen6174/LVMD
Low-level Vision Model Deployment
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
jacobswan1/ViTCAP
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
LX-doctorAI1/GSKET
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
peng-zhihui/HelloWord-Keyboard
zhjohnchan/awesome-image-captioning
A curated list of image captioning and related area resources. :-)
mad-red/VSR-guided-CIC
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
apache/echarts
Apache ECharts is a powerful, interactive charting and data visualization library for browser
YiwuZhong/Sub-GC
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
Gait3D/Gait3D-Benchmark
This is the code for the paper "Gait Recognition in the Wild with Dense 3D Representations and A Benchmark. (CVPR 2022)", "Gait Recognition in the Wild with Multi-hop Temporal Switch", and "Parsing is All You Need for Accurate Gait Recognition in the Wild".
microsoft/SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch