catfish132

catfish132's Stars

ghsama/ConvTransformerTimeSeries
Convolutional Transformer for time series
Language:Python26153
catfish132/REAP
this is the repository for REAP
Language:Python5
catfish132/DiffusionRRG
Language:Python102
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Language:Python1k111
niehen6174/LVMD
Low-level Vision Model Deployment
Language:C++102
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML10.4k906
jacobswan1/ViTCAP
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
Language:Python411
LX-doctorAI1/GSKET
Language:Python294
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook23.5k3.1k
peng-zhihui/HelloWord-Keyboard
Language:C5.9k965
zhjohnchan/awesome-image-captioning
A curated list of image captioning and related area resources. :-)
1.1k184
mad-red/VSR-guided-CIC
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
Language:Python364
apache/echarts
Apache ECharts is a powerful, interactive charting and data visualization library for browser
Language:TypeScript59.6k19.6k
YiwuZhong/Sub-GC
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
Language:Jupyter Notebook9315
Gait3D/Gait3D-Benchmark
This is the code for the paper "Gait Recognition in the Wild with Dense 3D Representations and A Benchmark. (CVPR 2022)", "Gait Recognition in the Wild with Multi-hop Temporal Switch", and "Parsing is All You Need for Accurate Gait Recognition in the Wild".
Language:Python12818
microsoft/SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
Language:Python89082
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python18.7k2.9k