jaycheney's Stars
KindXiaoming/pykan
Kolmogorov Arnold Networks
linyqh/NarratoAI
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
zuruoke/watermark-removal
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
chaofengc/IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
amusi/ECCV2024-Papers-with-Code
ECCV 2024 论文和开源项目合集,同时欢迎各位大佬提交issue,分享ECCV 2024论文和开源项目
InterDigitalInc/CompressAI
A PyTorch library and evaluation platform for end-to-end compression research
Meituan-AutoML/MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
chaofengc/Awesome-Image-Quality-Assessment
A comprehensive collection of IQA papers
Kobaayyy/Awesome-CVPR2024-CVPR2021-CVPR2020-Low-Level-Vision
A Collection of Papers and Codes for CVPR2024/CVPR2021/CVPR2020 Low Level Vision
braindotai/Watermark-Removal-Pytorch
🔥 CNN for Watermark Removal using Deep Image Prior with Pytorch 🔥.
Coobiw/MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
zhengli97/Awesome-Prompt-Adapter-Learning-for-VLMs
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
extreme-assistant/Awesome-CV-Team
国内外优秀的计算机视觉团队汇总,极市团队整理
zhengli97/PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
Yangzhangcst/Mamba-in-CV
A paper list of some recent Mamba-based CV works.
bcmi/SLBR-Visible-Watermark-Removal
[ACM MM 2021] Visible Watermark Removal via Self-calibrated Localization and Background Refinement
grip-unina/DMimageDetection
On the detection of synthetic images generated by diffusion models
zwx8981/LIQE
[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective
wusize/CLIPSelf
[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
ZhanYang-nwpu/Awesome-Remote-Sensing-Multimodal-Large-Language-Model
Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey
lzw-lzw/awesome-remote-sensing-vision-language-models
Awesome-Remote-Sensing-Vision-Language-Models
lcy0604/EraseNet
NJU-LHRS/LHRS-Bot
VGI-Enhanced multimodal large language model for remote sensing images.
lzhbrian/metrics
IS, FID score Pytorch and TF implementation, TF implementation is a wrapper of the official ones.
lzw-lzw/RemoteGLM
用于遥感图像场景分析的中文多模态大模型 | Chinese multimodal large-scale model for remote sensing image scene analysis
winycg/CLIP-KD
[CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation
HighwayWu/LASTED
Synthetic Image Detection
XiPotatonium/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
CLUEbenchmark/SuperCLUE-Image
中文原生文生图测评基准