zjx1277

Pinned Repositories

CrowdCLIP
[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
Language:Jupyter Notebook69 6 227
Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Language:Python1.1k 11 40255
PyDIff
[IJCAI 2023 ORAL] "Pyramid Diffusion Models For Low-light Image Enhancement" (Official Implementation)
Language:Python149 3 178
OmAgent
A multimodal agent framework for solving complex tasks
Language:Python383 11 924
OmModel
A collection of strong multimodal models for building multimodal AGI agents
37 4 11
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python5.8k 64 419401
Hi-SAM
[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Language:Python185 12 1810
MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python00

zjx1277's Repositories

zjx1277/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python00
zjx1277/UnicomTask
联通手机营业厅自动做任务、签到、领流量、领积分等。