soumitri2001
CS Ph.D. student @ UNC Chapel Hill || Researcher @ SRI International || Formerly @ ETS Montreal, SketchX Lab UoSurrey, ISI Kolkata
CS PhD @ UNC Chapel HillChapel Hill, NC, USA
soumitri2001's Stars
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
yenchenlin/nerf-pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
riffusion/riffusion
Stable diffusion for real-time music generation
facebookresearch/vissl
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
DirtyHarryLYL/LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
allenai/visprog
Official code for VisProg (CVPR 2023 Best Paper!)
AlaaLab/InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
maszhongming/Multi-LoRA-Composition
Repository for the Paper "Multi-LoRA Composition for Image Generation"
Amshaker/unetr_plus_plus
[IEEE TMI-2024] UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
snap-research/MyVLM
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)
mazurowski-lab/finetune-SAM
This is an official repo for fine-tuning SAM to customized medical images.
wenhui0206/NeuroGPT
Code for the paper "Neuro-GPT: Towards a Foundation Model for EEG"
yeerwen/MedCoSS
CVPR 2024 (Highlight)
BioMedIA-MBZUAI/MedPromptX
deep-diver/Vid2Persona
This project breathes life into video characters by using AI to describe their personality and then chat with you as them.
OmkarThawakar/composed-video-retrieval
Composed Video Retrieval
ShahinaKK/LG_SDG
Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]
HopLee6/Sports-QA
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports
ZcsrenlongZ/SelfSVD
[ECCV 2024, Oral] Self-Supervised Video Desmoking for Laparoscopic Surgery
FereshteShakeri/FewShot-CLIP-Strong-Baseline
BioMedIA-MBZUAI/XReal
Shahzadnit/EZ-CLIP
KamitaniLab/PyFastL2LiR
Fast L2-normalized linear regression
rrmina/MLP-Mixer-pytorch
A simple implementation of MLP Mixer in Pytorch