/Awesome-Anything

AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask

Awesome-Anything

Awesome Anything

A curated list of general AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, etc.

Contributions are welcome!

AnyObject

Title & Authors Intro Useful Links
Segment Anything
Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alex Berg, Wan-Yen Lo, Piotr Dollar, Ross Girshick
Preprint'23

[Segment Anything (Project)]
intro [Github]
[Page]
[Demo]
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu and Zhaoyang Zeng and Tianhe Ren and Feng Li and Hao Zhang and Jie Yang and Chunyuan Li and Jianwei Yang and Hang Su and Jun Zhu and Lei Zhang
Preprint'23

[Grounded-SAM, GroundingDINO (Project)]
intro [Github]
[Demo]
SegGPT: Segmenting Everything In Context
Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang
Preprint'23

[SegGPT (Project)]
image [Github]
V3Det: Vast Vocabulary Visual Detection Dataset
Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin
Preprint'23
image --
segment-anything-video (Project)
Kadir Nar
intro [Github]
Towards Segmenting Anything That Moves
Achal Dave, Pavel Tokmakov, Deva Ramanan
ICCV'19 Workshop

[segment-any-moving (Project)]
[Github]

AnyGeneration

Title & Authors Intro Useful Links
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and Björn Ommer
CVPR'22

[Stable-Diffusion (Project)]
intro [Github]
[Page]
[Demo]
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang, Maneesh Agrawala
Preprint'23

[ControlNet (Project)]
intro [Github]
[Demo]
GigaGAN: Large-scale GAN for Text-to-Image Synthesis
Minguk Kang, Jun-Yan Zhu, Richard Zhang, Jaesik Park, Eli Shechtman, Sylvain Paris, Taesung Park
CVPR'23
image [Page]
Inpaint-Anything: Segment Anything Meets Image Inpainting (Project)
Tao Yu
intro [Github]

AnyModel

Title & Authors Intro Useful Links
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang
Preprint'23

[Jarvis (Project)]
[Github]
[Demo]
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs
Yaobo Liang, Chenfei Wu, Ting Song, Wenshan Wu, Yan Xia, Yu Liu, Yang Ou, Shuai Lu, Lei Ji, Shaoguang Mao, Yun Wang, Linjun Shou, Ming Gong, Nan Duan Preprint'23
intro [Github]
DepGraph: Towards Any Structural Pruning
Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang
CVPR'23

[Torch-Pruning (Project)]
intro [Github]
[Demo]
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Yuhang Li and Mingzhu Shen and Jian Ma and Yan Ren and Mingxin Zhao and Qi Zhang and Ruihao Gong and Fengwei Yu and Junjie Yan
NeurIPS'21

[MQBench (Project)]
intro [Github]
[Page]
OTOv2: Automatic, Generic, User-Friendly
Tianyi Chen, Luming Liang, Tianyu Ding, Ilya Zharkov
ICLR'23

[Only Train Once (Project)]
intro [Github]
Deep Model Reassembly
Xingyi Yang, Daquan Zhou, Songhua Liu, Jingwen Ye, Xinchao Wang
NeurIPS'22

[Deep Model Reassembly (Project)]
intro [Github]
[Page]

AnyTask

Title & Authors Intro Useful Links
Generalized Decoding for Pixel, Image and Language
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao
CVPR'23

[X-Decoder (Project)]
intro [Github]
[Page]
[Demo]
Pre-Trained Image Processing Transformer
Chen, Hanting and Wang, Yunhe and Guo, Tianyu and Xu, Chang and Deng, Yiping and Liu, Zhenhua and Ma, Siwei and Xu, Chunjing and Xu, Chao and Gao, Wen
CVPR'21

[Pretrained-IPT (Project)]
intro [Github]

AnyX

...