Pinned Repositories
LLM_X_papers
Continually-updated reading list of LLM papers in Finance, Healthcare, and Law
Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
awesome-radiology-report-generation
A curated list of radiology report generation (medical report generation) and related areas. :-)
CosRec
Code for CosRec: 2D Convolutional Neural Networks for Sequential Recommendation (CIKM-19)
Gest
A multimodal dataset for google map restaurants.
MM-Navigator
GPT-4V in Wonderland: LMMs as Smartphone Agents
RadBERT
Code and models for Paper RadBERT: Adapting transformer-based language models to radiology
SoM-LLaVA
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
WCL
Code for Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation (EMNLP-21)
XL-VLN
Dataset for Bilingual VLN
zzxslp's Repositories
zzxslp/MM-Navigator
GPT-4V in Wonderland: LMMs as Smartphone Agents
zzxslp/SoM-LLaVA
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
zzxslp/CosRec
Code for CosRec: 2D Convolutional Neural Networks for Sequential Recommendation (CIKM-19)
zzxslp/WCL
Code for Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation (EMNLP-21)
zzxslp/XL-VLN
Dataset for Bilingual VLN
zzxslp/Gest
A multimodal dataset for google map restaurants.
zzxslp/RadBERT
Code and models for Paper RadBERT: Adapting transformer-based language models to radiology
zzxslp/Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
zzxslp/awesome-radiology-report-generation
A curated list of radiology report generation (medical report generation) and related areas. :-)
zzxslp/clinicalBERT
repository for Publicly Available Clinical BERT Embeddings
zzxslp/constrained_decoding
Lexically constrained decoding for sequence generation using Grid Beam Search
zzxslp/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
zzxslp/fluent-python
《流畅的Python》2015年8月
zzxslp/gancaption_iccv2017
Towards Diverse and Natural Image Descriptions via a Conditional GAN
zzxslp/hello-world
Just a repository
zzxslp/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
zzxslp/pytorch-sgns
Skipgram Negative Sampling in PyTorch
zzxslp/Realtime_Multi-Person_Pose_Estimation
Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)
zzxslp/SoM
Set-of-Mark Prompting for LMMs
zzxslp/state-spaces
Sequence Modeling with Structured State Spaces
zzxslp/testimg
zzxslp/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
zzxslp/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks
zzxslp/zzxslp.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes