beautifulchoi's Stars
mlfoundations/open_clip
An open source implementation of CLIP.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
ashleve/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
facebookresearch/theseus
A library for differentiable nonlinear optimization
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
IDEA-Research/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Sense-X/UniFormer
[ICLR2022] official implementation of UniFormer
v-iashin/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
haofanwang/video-swin-transformer-pytorch
Video Swin Transformer - PyTorch
gaomingqi/Awesome-Video-Object-Segmentation
:bookmark: Curated list of video object segmentation (VOS) papers, datasets, and projects.
Gy920/segment-anything-2-real-time
Run Segment Anything Model 2 on a live video stream
robo-alex/awesome-scene-representation
A curated list of awesome scene representation(NeRFs) papers, code, and resources.
facebookresearch/videoalignment
Learning to align and match videos with kernelized temporal layers
TengdaHan/TemporalAlignNet
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
Malitha123/awesome-video-self-supervised-learning
A curated list of awesome self-supervised learning methods in videos
qimaqi/ShapeSplat-Gaussian_MAE
Offical implementation of work: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining
yd-yin/SAI3D
[CVPR 2024] SAI3D: Segment Any Instance in 3D Scenes
hadjisma/VideoAlignment
pengsongyou/lseg_feature_extraction
Wang-pengfei/GGSD
Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024
hyunji12/Open3DRF
drivendataorg/video-similarity-challenge
Links to winning solutions for the Meta AI Video Similarity Challenge
trquhuytin/LAV-CVPR21
Video-MAC/VideoMAC
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
Kitsunetic/kitsu
yogendra-yatnalkar/SAM-Promptless-Task-Specific-Finetuning
Promtless-TaskSpecific-Finetuning of MetaAI Segment-Anything Model
TuBui/deep_image_comparator
Code for CVPR WMF 2021 paper "Deep Image Comparator: Learning to Visualize Editorial Change"
Kitsunetic/docker-server-notion
서버 도커 관리용... 서버좀 깨끗하게 씁시다