zhiyuanyou

Ph.D. candidate in MMLab, CUHK

The Chinese University of Hong KongHong Kong

zhiyuanyou's Stars

Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.6k 448 3155.1k
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python22k 115 7671.6k
ShiArthur03/ShiArthur03
Language:MATLAB10.3k 32 1.4k1.9k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python9.9k 127 478938
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Language:Python6.2k 44 148397
richzhang/PerceptualSimilarity
LPIPS metric. pip install lpips
Language:Python3.7k 53 109502
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2.2k 29 173148
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.5k 26 7771
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python836 35 2634
AILab-CVC/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
Language:Python588 15 5032
xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
Language:Jupyter Notebook377 5 3126
Q-Future/Q-Align
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
Language:Python319 2 3922
yuweihao/MM-Vet
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)
Language:Python275 2 711
zhangzjn/ADer
ADer (https://arxiv.org/abs/2406.03262) is an open source visual anomaly detection toolbox based on PyTorch, which supports multiple popular AD datasets and approaches.
Language:Python208 5 5112
zwx8981/LIQE
[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective
Language:Python201 1 3111
OpenImagingLab/HDRFlow
[CVPR 2024] Real-Time HDR Video Reconstruction
Language:Python130 5 66
google-research-datasets/richhf-18k
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or images are included in this dataset).
111 6 123
kamwoh/partcraft
[ECCV2024] PartCraft: Crafting Creative Objects by Parts
Language:Python85 4 11
HaomingCai/PIPAL-dataset
[ Official ] - PIPAL Dataset and Training Codebase. ECCV-2020, NTIRE-21/22.
Language:Python70 3 113
lcysyzxdxc/AGIQA-3k-Database
[IEEE TCSVT2023] A Fine-grained Subjective Perception & Alignment Database for AI Generated Image Quality Assessment
52 2 42
OpenImagingLab/emm
[ECCV2024] Event-Based Motion Magnification
Language:Python50 3 13
lingli1996/GeoReasoner
[ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode
Language:Python34 7 31
SUPER-TADORY/1p-frac
Language:Python29 1 20
OpenImagingLab/DualDn
[ECCV2024] DualDn: Dual-domain Denoising via Differentiable ISP. :star::triumph::star::skull: SOTA model for real-image denoising. :skull: Both in terms of denoising performance and generalization ability.
Language:Python28 4 61
Kaiwen-Zhu/AgenticIR
An Intelligent Agentic System for Complex Image Restoration Problems
Language:Python25 1 00
OpenImagingLab/LenslessFace
LenslessFace : An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification
Language:Python18 1 01
huTao1030/DiffHDR-pytorch
This is the official PyTorch implementation for DiffHDR: Towards High-quality HDR Deghosting with Conditional Diffusion Models (TCSVT'2023)
Language:Python9 2 21
OpenImagingLab/PhoCoLens
PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging
8 2 10
PhoCoLens/PhoCoLens.github.io
Language:JavaScript20
zhiyuanyou/DepictQA
DepictQA: Depicted Image Quality Assessment with Vision Language Models
Language:Python1 0 00

zhiyuanyou

zhiyuanyou's Stars

Stability-AI/stablediffusion

opendatalab/MinerU

ShiArthur03/ShiArthur03

THUDM/CogVideo

opendatalab/PDF-Extract-Kit

richzhang/PerceptualSimilarity

THUDM/CogVLM2

dvlab-research/ControlNeXt

lucidrains/transfusion-pytorch

AILab-CVC/SEED

xmed-lab/CLIP_Surgery

Q-Future/Q-Align

yuweihao/MM-Vet

zhangzjn/ADer

zwx8981/LIQE

OpenImagingLab/HDRFlow

google-research-datasets/richhf-18k

kamwoh/partcraft

HaomingCai/PIPAL-dataset

lcysyzxdxc/AGIQA-3k-Database

OpenImagingLab/emm

lingli1996/GeoReasoner

SUPER-TADORY/1p-frac

OpenImagingLab/DualDn

Kaiwen-Zhu/AgenticIR

OpenImagingLab/LenslessFace

huTao1030/DiffHDR-pytorch

OpenImagingLab/PhoCoLens

PhoCoLens/PhoCoLens.github.io

zhiyuanyou/DepictQA