MarcosRodrigoT's Stars
akiomik/vimeo-dl
A cli tool to download private videos on vimeo. Written in golang.
nlpfromscratch/webinars
milvus-io/bootcamp
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
karpathy/ng-video-lecture
kelseyhightower/kubernetes-the-hard-way
Bootstrap Kubernetes the hard way. No scripts.
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
EasonXiao-888/UVCOM
[CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
EdenGabriel/TaskWeave
[CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
thswodnjs3/CSTA
The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"
amathislab/wildclip
Scene and animal attribute retrieval from camera trap data with domain-adapted vision-language models
CVIR/TCL
Semi-Supervised Action Recognition with Temporal Contrastive Learning
antoine77340/MIL-NCE_HowTo100M
PyTorch GPU distributed training code for MIL-NCE HowTo100M
TencentARC/UMT
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
rasbt/machine-learning-book
Code Repository for Machine Learning with PyTorch and Scikit-Learn
wjun0830/QD-DETR
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
RaivoKoot/Video-Dataset-Loading-Pytorch
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
kubeflow/training-operator
Distributed ML Training and Fine-Tuning on Kubernetes
HopLee6/SSPVS-PyTorch
Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
iamadamdev/bypass-paywalls-chrome
Bypass Paywalls web browser extension for Chrome and Firefox.
wagoodman/dive
A tool for exploring each layer in a docker image
Xpra-org/xpra
Persistent remote applications for X11; screen sharing for X11, MacOS and MSWindows.
mlfoundations/open_clip
An open source implementation of CLIP.
princeton-vl/RAFT
hkchengrex/Cutie
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
qianqianwang68/omnimotion
boheumd/A2Summ
The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)
medhini/Instructional-Video-Summarization
Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022
v-iashin/BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image