HY-Wong

HY-Wong's Stars

NVlabs/ffhq-dataset
Flickr-Faces-HQ Dataset (FFHQ)
Language:Python3.8k587
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python5k349
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.5k994
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook26.3k3.4k
MILVLG/bottom-up-attention.pytorch
A PyTorch reimplementation of bottom-up-attention models
Language:Jupyter Notebook29476
Link-Li/CLMLF
Language:Python738
IsaacBravo/streamlit-app
This is an interactive app that allow users play around with the clip model to analyze images
Language:Python31
zhutong0219/ITIN
Multimodal Sentiment Analysis with Image-Text Interaction Network
Language:Python12