JialeCao001's Stars
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
JialianW/TraDeS
Track to Detect and Segment: An Online Multi-Object Tracker (CVPR 2021)
mbzuai-oryx/XrayGPT
[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.
lahoud/3d-vision-transformers
A list of 3D computer vision papers with Transformers
JialeCao001/SipMask
SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation (ECCV2020)
JialianW/GRiT
GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)
JialeCao001/PedSurvey
From Handcrafted to Deep Features for Pedestrian Detection: A Survey (TPAMI 2021)
xb534/SED
[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.
Leotju/MGAN
Mask-Guided Attention Network for Occluded Pedestrian Detection. (ICCV'19)
mbzuai-oryx/ClimateGPT
[EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabic languages.
JialeCao001/HSD
Hierarchical Shot Detector (ICCV2019)
cp3wan/DFormer
JialianW/Forest_RCNN
Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)
zwq456/CLIP-VIS
[IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.
linsun449/iseg.code
This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.
megvii-research/Co-mining
Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection, AAAI 2021.
linsun449/cliper.code
This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation