hengrui0516's Stars
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
zyang-ur/idea2img
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024
opendatalab/UrBench
Picsart-AI-Research/LIVE-Layerwise-Image-Vectorization
[CVPR 2022 Oral] Towards Layer-wise Image Vectorization
WisconsinAIVision/UniversalFakeDetect
gendetection/UnbiasedGenImage
Corresponding Code to the Paper "Fake or JPEG? Revealing Common Biases in Generated Image Detection Datasets"
opendatalab/OHR-Bench
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
opendatalab/OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
owenzlz/PAL4VST
Perceptual Artifacts Localization for Image Synthesis Tasks (ICCV 23')
Michel-liu/FatFormer
[CVPR 2024] The official repo for Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
sfimediafutures/CLIPping-the-Deception
Code and pre-trained models for our paper "CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection".
opendatalab/UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
visioncortex/vtracer
Raster to Vector Graphics Converter
opendatalab/skydiffusion
The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
Dinghow/UIM
The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting
opendatalab/DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
yogendra-yatnalkar/SAM-Promptless-Task-Specific-Finetuning
Promtless-TaskSpecific-Finetuning of MetaAI Segment-Anything Model
beccabai/multi-agent-data-selection
This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.
opendatalab/LOKI
The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
yun-liu/FastSaliency
Code for "SAMNet: Stereoscopically Attentive Multi-scale Network for Lightweight Salient Object Detection" and "Lightweight Salient Object Detection via Hierarchical Visual Perception Learning"