sean-xr

Technical University of MunichMunich

sean-xr's Stars

WalBouss/LeGrad
Language:Python605
deepglint/ALIP
[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
Language:Python907
hammoudhasan/SynthCLIP
Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.
Language:Python871
fwalch/tum-thesis-latex
:notebook_with_decorative_cover: A LaTeX template for TUM Bachelor/Master theses.
Language:TeX440242
hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
Language:Jupyter Notebook792107
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Language:Jupyter Notebook1.8k240
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python64623
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python31.9k4.7k
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.7k1k
ShiArthur03/ShiArthur03
Language:MATLAB10.4k1.9k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.8k452
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Language:Python2.3k181
ylaxor/clip-like
Train (fine-tune) OpenAI's CLIP-like models on custom image-caption data sets, cf. COCO dataset. PyTorch implementation.
Language:Jupyter Notebook185
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.3k761
facebookresearch/SLIP
Code release for SLIP Self-supervision meets Language-Image Pre-training
Language:Python74367
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Language:Python2k34
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.1k968
dongliangcao/Unsupervised-Learning-of-Robust-Spectral-Shape-Matching
SIGGRAPH23: Unsupervised Learning of Robust Spectral Shape Matching
Language:Python4910
Pointcept/GPT4Point
[CVPR'24 Highlight] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.
Language:Python32220
NVIDIAGameWorks/kaolin
A PyTorch Library for Accelerating 3D Deep Learning Research
Language:Python4.5k553
niladridutt/Diffusion-3D-Features
Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features [CVPR 2024]
Language:Python5910
developer0hye/PyTorch-Deformable-Convolution-v2
Don't feel pain to use Deformable Convolution
Language:Jupyter Notebook32531
OutofAi/2D-Gaussian-Splatting
A 2D Gaussian Splatting paper for no obvious reasons. Enjoy!
Language:Jupyter Notebook38117
tsunghan-wu/SLD
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
Language:Python1526
wimmerth/back-to-3d-few-shot-keypoints
Repository of the CVPR paper "Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected Features".
Language:Python101
gviga/AB-ZoomOut
Language:Python1
szacho/pointcam
Self-supervised adversarial masking for point clouds
Language:Python111
muse1998/Source-Free-Domain-Generalization
An open-world scenario domain generalization code base
Language:Python241
nstucki/Betti-matching
Language:Python443
Colin97/OpenShape_code
official code of “OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding”
Language:Python23916

sean-xr

sean-xr's Stars

WalBouss/LeGrad

deepglint/ALIP

hammoudhasan/SynthCLIP

fwalch/tum-thesis-latex

hila-chefer/Transformer-MM-Explainability

hila-chefer/Transformer-Explainability

lucidrains/transfusion-pytorch

huggingface/pytorch-image-models

facebookresearch/sam2

ShiArthur03/ShiArthur03

OpenGVLab/InternVL

webdataset/webdataset

ylaxor/clip-like

facebookresearch/ImageBind

facebookresearch/SLIP

yuweihao/MambaOut

mlfoundations/open_clip

dongliangcao/Unsupervised-Learning-of-Robust-Spectral-Shape-Matching

Pointcept/GPT4Point

NVIDIAGameWorks/kaolin

niladridutt/Diffusion-3D-Features

developer0hye/PyTorch-Deformable-Convolution-v2

OutofAi/2D-Gaussian-Splatting

tsunghan-wu/SLD

wimmerth/back-to-3d-few-shot-keypoints

gviga/AB-ZoomOut

szacho/pointcam

muse1998/Source-Free-Domain-Generalization

nstucki/Betti-matching

Colin97/OpenShape_code