ZhuangPeiyu

Shenzhen UniversityShenzhen

ZhuangPeiyu's Stars

ucaslcl/Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
Language:Python1006
quqxui/Awesome-LLM4IE-Papers
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
65937
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python74435
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.5k2.5k
rshaojimmy/MultiModal-DeepFake
[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond
Language:Python33025
scu-zjz/IMDLBenCo
A comprehensive benchmark & codebase for Image manipulation detection/localization.
Language:Python425
qcf-568/MIML
[CVPR2024] Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods
Language:Python301
OpenGVLab/MM-NIAH
This is the official implementation of the paper "Needle In A Multimodal Haystack"
Language:Python724
greatzh/Papers
Image Forgery Detection and Localization (and related) Papers List
Language:HTML24821
Ekko-zn/AIGCDetectBenchmark
Language:Python19220
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.6k84
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6k533
Coobiw/MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
Language:Jupyter Notebook35420
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML7.8k753
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Language:Python1.3k84
OSU-NLP-Group/TableLlama
[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".
Language:Python1038
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Language:Python1.7k151
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python6.2k649
csuhan/OneLLM
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
Language:Python55226
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Language:Python2.7k170
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。
Language:Python4k375
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.6k941
YuchenLiu98/COMM
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
1815
modelscope/modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
Language:Python2.6k297
clpeng/Awesome-Face-Forgery-Generation-and-Detection
A curated list of articles and codes related to face forgery generation and detection.
688113
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.7k758
GenImage-Dataset/GenImage
Language:Python30027
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python38.4k5k
ZhendongWang6/DIRE
[ICCV 2023] Official implementation of the paper: "DIRE for Diffusion-Generated Image Detection"
Language:Python25420
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python29.8k2.7k

ZhuangPeiyu

ZhuangPeiyu's Stars

ucaslcl/Fox

quqxui/Awesome-LLM4IE-Papers

VITA-MLLM/VITA

microsoft/unilm

rshaojimmy/MultiModal-DeepFake

scu-zjz/IMDLBenCo

qcf-568/MIML

OpenGVLab/MM-NIAH

greatzh/Papers

Ekko-zn/AIGCDetectBenchmark

baaivision/Emu

facebookresearch/DiT

Coobiw/MPP-LLaVA

LianjiaTech/BELLE

X-PLUG/mPLUG-DocOwl

OSU-NLP-Group/TableLlama

Ucas-HaoranWei/Vary

IDEA-Research/GroundingDINO

csuhan/OneLLM

Alpha-VLLM/LLaMA2-Accessory

IDEA-CCNL/Fengshenbang-LM

salesforce/LAVIS

YuchenLiu98/COMM

modelscope/modelscope-agent

clpeng/Awesome-Face-Forgery-Generation-and-Detection

BradyFU/Awesome-Multimodal-Large-Language-Models

GenImage-Dataset/GenImage

Stability-AI/stablediffusion

ZhendongWang6/DIRE

lllyasviel/ControlNet