AKASH2907
CS Ph.D. Student @UCF Center for Research in Computer Vision (CRCV)
University of Central FloridaOrlando, FL
AKASH2907's Stars
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
AliaksandrSiarohin/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
cxli233/FriendsDontLetFriends
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Breakthrough/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
microsoft/VideoX
VideoX: a collection of video cross-modal models
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
MarkMoHR/Awesome-Referring-Image-Segmentation
:books: A collection of papers about Referring Image Segmentation.
m-bain/frozen-in-time
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
yuantn/MI-AOD
Code for Multiple Instance Active Learning for Object Detection, CVPR 2021
MCG-NJU/EMA-VFI
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
MKLab-ITI/visil
Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]
antoine77340/S3D_HowTo100M
S3D Text-Video model trained on HowTo100M using MIL-NCE
Yui010206/SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
jonbarron/tabilize
Simple code for generating a color-coded latex table from raw data
jayleicn/singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
doc-doc/NExT-QA
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
farewellthree/STAN
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
Malitha123/awesome-video-self-supervised-learning
A curated list of awesome self-supervised learning methods in videos
harlanhong/MM2021-CO2-Net
princetonvisualai/multimodal_dataset_distillation
xh-liu/CM-Erase-REG
Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"
jakubmonhart/mil_pytorch
Multiple instance learning model implemented in pytorch
PKU-ML/CLIP-Help-SimCLR
Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning
MCG-NJU/EVAD
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
zjh31/CPL
micts/acgcn
Code for the paper "Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection"