xwsgithub's Stars
NLPJCL/RAG-Retrieval
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.
x-cls/superclass
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
deepglint/ALIP
[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
lucidrains/x-clip
A concise but complete implementation of CLIP with various experimental improvements from recent papers
ant-research/DreamLIP
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
jina-ai/clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
timesler/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
anderskm/gputil
A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi programmically in Python
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
timothybrooks/instruct-pix2pix
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
ZhouKanglei/Awesome-AQA
Awesome Action Quality Assessment (AQA)
hassony2/kinetics_i3d_pytorch
Inflated i3d network with inception backbone, weights transfered from tensorflow
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Youngluc/CBLIP2
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
forthespada/CampusShame
互联网仍有记忆!那些曾经在校招过程中毁过口头offer、意向书、三方的公司!纵然人微言轻,也想尽绵薄之力!
Lyman-Smoker/Awesome-AQA
A curated list of Action Quality Assessment and related area resources
luca-medeiros/lang-segment-anything
SAM with text prompt
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
IDEA-Research/detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
hqu-cst-mmc/PCLN
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
ParitoshParmar/Fitness-AQA
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
yuxumin/CoRe
[ICCV 2021] Group-aware Contrastive Regression for Action Quality Assessment
baiyang4/aqa_tpt
implementation of "Action Quality Assessment with Temporal Parsing Transformer"
Luciferbobo/DAE-AQA
Auto-Encoding Score Distribution Regression for Action Quality Assessment
xuangch/CVPR22_GDLT
The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".