ombretta's Stars
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
kyegomez/NaViT
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
basnijholt/thesis-cover
Parametrically designing my PhD thesis cover using adaptive sampling, neural networks, and quantum physics
eric-xw/kinetics-i3d-pytorch