lzhx171

DLUT

lzhx171's Stars

yhy-2000/MomentSeeker
Language:Python17
dbstjswo505/WMRN
Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval
Language:Python6612
minghangz/cnm
Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining
Language:Python294
foolwood/DRL
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
Language:Python965
farewellthree/STAN
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
Language:Python1064
star-whale/starwhale
an MLOps/LLMOps platform
Language:Java23439
j-min/HiREST
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
Language:Python10511
layer6ai-labs/xpool
https://layer6ai-labs.github.io/xpool/
Language:Python12710
whwu95/Cap4Video
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Language:Python24417
deezer/spleeter
Deezer source separation library including pretrained models.
Language:Python27.4k3k
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Language:Python13.8k3k
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python69.2k8.4k
zju3dv/LoFTR
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Language:Jupyter Notebook2.7k390
transvcl/TransVCL
TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]
Language:Python557
fuergaosi233/wechat-chatgpt
Use ChatGPT On Wechat via wechaty
Language:TypeScript13.3k3.8k
IndustryEssentials/ymir
YMIR, a streamlined model development product.
Language:Python593161
lyakaap/ISC21-Descriptor-Track-1st
The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.
Language:Python14119
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
Language:Python35428
natanielruiz/disrupting-deepfakes
🔥🔥Defending Against Deepfakes Using Adversarial Attacks on Conditional Image Translation Networks
Language:Python34149
alipay/VCSL
Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]
Language:Python12818
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python12.2k1.9k
wdrink/PyDeepFakeDet
PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.
Language:Python11316
liangchen527/SLADD
Official code for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection (CVPR 2022 oral)
Language:Python14016
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook5.5k717
BAAI-WuDao/BriVL
Bridging Vision and Language Model
Language:Python28431
johanmodin/clifs
Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP
Language:JavaScript47653
nashory/DeLF-pytorch
PyTorch Implementation of "Large-Scale Image Retrieval with Attentive Deep Local Features"
Language:Jupyter Notebook35063
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook30.7k3.7k
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Language:Python968106
Tianxiaomo/pytorch-YOLOv4
PyTorch ,ONNX and TensorRT implementation of YOLOv4
Language:Python4.5k1.5k

lzhx171

lzhx171's Stars

yhy-2000/MomentSeeker

dbstjswo505/WMRN

minghangz/cnm

foolwood/DRL

farewellthree/STAN

star-whale/starwhale

j-min/HiREST

layer6ai-labs/xpool

whwu95/Cap4Video

deezer/spleeter

PaddlePaddle/PaddleDetection

binary-husky/gpt_academic

zju3dv/LoFTR

transvcl/TransVCL

fuergaosi233/wechat-chatgpt

IndustryEssentials/ymir

lyakaap/ISC21-Descriptor-Track-1st

facebookresearch/sscd-copy-detection

natanielruiz/disrupting-deepfakes

alipay/VCSL

PaddlePaddle/PaddleSpeech

wdrink/PyDeepFakeDet

liangchen527/SLADD

salesforce/BLIP

BAAI-WuDao/BriVL

johanmodin/clifs

nashory/DeLF-pytorch

openai/CLIP

YehLi/xmodaler

Tianxiaomo/pytorch-YOLOv4