soumitri2001

CS Ph.D. student @ UNC Chapel Hill || Researcher @ SRI International || Formerly @ ETS Montreal, SketchX Lab UoSurrey, ISI Kolkata

CS PhD @ UNC Chapel HillChapel Hill, NC, USA

soumitri2001's Stars

hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.7k 185 4862.1k
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.2k 99 89758
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6k 45 80535
yenchenlin/nerf-pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
Language:Python5.4k 53 1151.1k
riffusion/riffusion
Stable diffusion for real-time music generation
Language:Python3.3k 38 93380
facebookresearch/vissl
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
Language:Jupyter Notebook3.3k 54 173332
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
1.9k 112 3022
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.7k 50 1388
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
1.3k 38 468
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.2k 21 5448
DirtyHarryLYL/LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
829 52 1435
allenai/visprog
Official code for VisProg (CVPR 2023 Best Paper!)
Language:Python684 15 1864
AlaaLab/InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
Language:Python518 32 846
maszhongming/Multi-LoRA-Composition
Repository for the Paper "Multi-LoRA Composition for Image Generation"
Language:Python430 9 1045
Amshaker/unetr_plus_plus
[IEEE TMI-2024] UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
Language:Python347 5 7732
snap-research/MyVLM
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)
Language:Python143 14 58
mazurowski-lab/finetune-SAM
This is an official repo for fine-tuning SAM to customized medical images.
Language:Python118 2 1617
wenhui0206/NeuroGPT
Code for the paper "Neuro-GPT: Towards a Foundation Model for EEG"
Language:Python105 4 422
yeerwen/MedCoSS
CVPR 2024 (Highlight)
Language:Python96 3 82
BioMedIA-MBZUAI/MedPromptX
Language:Jupyter Notebook54 2 11
deep-diver/Vid2Persona
This project breathes life into video characters by using AI to describe their personality and then chat with you as them.
Language:Jupyter Notebook44 2 16
OmkarThawakar/composed-video-retrieval
Composed Video Retrieval
Language:Python43 2 50
ShahinaKK/LG_SDG
Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]
Language:Jupyter Notebook28 1 11
HopLee6/Sports-QA
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports
27 1 00
ZcsrenlongZ/SelfSVD
[ECCV 2024, Oral] Self-Supervised Video Desmoking for Laparoscopic Surgery
21 2 10
FereshteShakeri/FewShot-CLIP-Strong-Baseline
Language:Python20 2 11
BioMedIA-MBZUAI/XReal
Language:Python18 4 20
Shahzadnit/EZ-CLIP
Language:Python17 1 33
KamitaniLab/PyFastL2LiR
Fast L2-normalized linear regression
Language:Python8 7 02
rrmina/MLP-Mixer-pytorch
A simple implementation of MLP Mixer in Pytorch
Language:Python6 3 05

soumitri2001

soumitri2001's Stars

hpcaitech/Open-Sora

facebookresearch/ImageBind

facebookresearch/DiT

yenchenlin/nerf-pytorch

riffusion/riffusion

facebookresearch/vissl

layerdiffusion/LayerDiffuse

ChenHsing/Awesome-Video-Diffusion-Models

yunlong10/Awesome-LLMs-for-Video-Understanding

FoundationVision/LlamaGen

DirtyHarryLYL/LLM-in-Vision

allenai/visprog

AlaaLab/InstructCV

maszhongming/Multi-LoRA-Composition

Amshaker/unetr_plus_plus

snap-research/MyVLM

mazurowski-lab/finetune-SAM

wenhui0206/NeuroGPT

yeerwen/MedCoSS

BioMedIA-MBZUAI/MedPromptX

deep-diver/Vid2Persona

OmkarThawakar/composed-video-retrieval

ShahinaKK/LG_SDG

HopLee6/Sports-QA

ZcsrenlongZ/SelfSVD

FereshteShakeri/FewShot-CLIP-Strong-Baseline

BioMedIA-MBZUAI/XReal

Shahzadnit/EZ-CLIP

KamitaniLab/PyFastL2LiR

rrmina/MLP-Mixer-pytorch