Pinned Repositories
ALBEF
Code for ALBEF: a new vision-language pre-training method
Bert
Bert original code
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
CAMERA
DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
FNE
Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..
HAT
Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'
MM-Forecast
Implementation of our paper, "MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models".
NAAF
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
pvse
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
LuminosityX's Repositories
LuminosityX/HAT
Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'
LuminosityX/FNE
Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..
LuminosityX/MM-Forecast
Implementation of our paper, "MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models".
LuminosityX/pvse
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
LuminosityX/CAMERA
LuminosityX/NAAF
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
LuminosityX/ALBEF
Code for ALBEF: a new vision-language pre-training method
LuminosityX/Bert
Bert original code
LuminosityX/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
LuminosityX/DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
LuminosityX/llama
Inference code for LLaMA models
LuminosityX/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
LuminosityX/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
LuminosityX/MIC
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
LuminosityX/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
LuminosityX/MMT-Retrieval