Pinned Repositories
CrowdCLIP
[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
PyDIff
[IJCAI 2023 ORAL] "Pyramid Diffusion Models For Low-light Image Enhancement" (Official Implementation)
OmAgent
A multimodal agent framework for solving complex tasks
OmModel
A collection of strong multimodal models for building multimodal AGI agents
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Hi-SAM
[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
zjx1277's Repositories
zjx1277/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
zjx1277/UnicomTask
联通手机营业厅自动做任务、签到、领流量、领积分等。