Pinned Repositories
ml-aim
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
S3Diff
Official implementation of S3Diff
CACNet-Pytorch
Unofficial PyTorch implementation of "Composing Photos Like a Photographer"
AVG-LLaVA
Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"
gencrop
Code for Learning Subject-Aware Cropping by Outpainting Professional Photos
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Q-Align
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
GalleryGPT
S2CNet
Official PyTorch implementation of the “Spatial-Semantic Collaborative Cropping for User Generated Content”. (AAAI24)
OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
dongdk's Repositories
dongdk doesn’t have any repository yet.