hengrui0516

SJTU student

SJTUShanghai, China

hengrui0516's Stars

mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Language:Python80138
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Language:Jupyter Notebook42414
zyang-ur/idea2img
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024
Language:Python192
opendatalab/UrBench
Language:JavaScript1
Picsart-AI-Research/LIVE-Layerwise-Image-Vectorization
[CVPR 2022 Oral] Towards Layer-wise Image Vectorization
Language:Python50554
WisconsinAIVision/UniversalFakeDetect
Language:Python23428
gendetection/UnbiasedGenImage
Corresponding Code to the Paper "Fake or JPEG? Revealing Common Biases in Generated Image Detection Datasets"
Language:Python16
opendatalab/OHR-Bench
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
Language:Python4910
opendatalab/OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
Language:Python16716
owenzlz/PAL4VST
Perceptual Artifacts Localization for Image Synthesis Tasks (ICCV 23')
Language:Python514
Michel-liu/FatFormer
[CVPR 2024] The official repo for Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
Language:Python723
sfimediafutures/CLIPping-the-Deception
Code and pre-trained models for our paper "CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection".
Language:Python537
opendatalab/UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Language:Python23521
visioncortex/vtracer
Raster to Vector Graphics Converter
Language:Rust3.6k246
opendatalab/skydiffusion
The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
Language:Python402
Dinghow/UIM
The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting
Language:Python24
opendatalab/DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Language:Python67750
yogendra-yatnalkar/SAM-Promptless-Task-Specific-Finetuning
Promtless-TaskSpecific-Finetuning of MetaAI Segment-Anything Model
Language:Jupyter Notebook6
beccabai/multi-agent-data-selection
This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.
35
opendatalab/LOKI
The official implementation of the paper “LOKI：A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
Language:Python1211
yun-liu/FastSaliency
Code for "SAMNet: Stereoscopically Attentive Multi-scale Network for Lightweight Salient Object Detection" and "Lightweight Salient Object Detection via Hierarchical Visual Perception Learning"
Language:Python5417

hengrui0516

hengrui0516's Stars

mbzuai-oryx/groundingLMM

tgxs002/HPSv2

zyang-ur/idea2img

opendatalab/UrBench

Picsart-AI-Research/LIVE-Layerwise-Image-Vectorization

WisconsinAIVision/UniversalFakeDetect

gendetection/UnbiasedGenImage

opendatalab/OHR-Bench

opendatalab/OmniDocBench

owenzlz/PAL4VST

Michel-liu/FatFormer

sfimediafutures/CLIPping-the-Deception

opendatalab/UniMERNet

visioncortex/vtracer

opendatalab/skydiffusion

Dinghow/UIM

opendatalab/DocLayout-YOLO

yogendra-yatnalkar/SAM-Promptless-Task-Specific-Finetuning

beccabai/multi-agent-data-selection

opendatalab/LOKI

yun-liu/FastSaliency