lzhx171's Stars
yhy-2000/MomentSeeker
dbstjswo505/WMRN
Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval
minghangz/cnm
Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining
foolwood/DRL
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
farewellthree/STAN
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
star-whale/starwhale
an MLOps/LLMOps platform
j-min/HiREST
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
layer6ai-labs/xpool
https://layer6ai-labs.github.io/xpool/
whwu95/Cap4Video
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
deezer/spleeter
Deezer source separation library including pretrained models.
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
zju3dv/LoFTR
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
transvcl/TransVCL
TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]
fuergaosi233/wechat-chatgpt
Use ChatGPT On Wechat via wechaty
IndustryEssentials/ymir
YMIR, a streamlined model development product.
lyakaap/ISC21-Descriptor-Track-1st
The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
natanielruiz/disrupting-deepfakes
🔥🔥Defending Against Deepfakes Using Adversarial Attacks on Conditional Image Translation Networks
alipay/VCSL
Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
wdrink/PyDeepFakeDet
PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.
liangchen527/SLADD
Official code for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection (CVPR 2022 oral)
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
BAAI-WuDao/BriVL
Bridging Vision and Language Model
johanmodin/clifs
Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP
nashory/DeLF-pytorch
PyTorch Implementation of "Large-Scale Image Retrieval with Attentive Deep Local Features"
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Tianxiaomo/pytorch-YOLOv4
PyTorch ,ONNX and TensorRT implementation of YOLOv4