demonzyj56's Stars
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
deepghs/imgutils
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
ultralytics/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
beichenzbc/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
lyst/lightfm
A Python implementation of LightFM, a hybrid recommendation algorithm.
jinyeying/DC-ShadowNet-Hard-and-Soft-Shadow-Removal
[ICCV2021]"DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network", https://arxiv.org/abs/2207.10434
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Benjamin-Loison/YouTube-operational-API
YouTube operational API works when YouTube Data API v3 fails.
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
tencentyun/cos-python-sdk-v5
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
kanosawa/anime_face_landmark_detection
Anime face landmark detection by deep cascaded regression
abhishekkrthakur/approachingalmost
Approaching (Almost) Any Machine Learning Problem
zengyh1900/Awesome-Image-Inpainting
A curated list of image inpainting and video inpainting papers and resources
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
naoto0804/pytorch-inpainting-with-partial-conv
Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions' [Liu+, ECCV2018]
chat2db/Chat2DB
🔥🔥🔥AI-driven data management platform Over 1 million developers are using Chat2DB
bcmi/Awesome-Visible-Watermark-Removal
tanimutomo/partialconv
Re-Implementation of "Image Inpainting for Irregular Holes using Partial Convolution"
facebookresearch/ssl_watermarking
Official implementation of "Watermarking Images in Self-Supervised Latent-Spaces"
benfred/implicit
Fast Python Collaborative Filtering for Implicit Feedback Datasets
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
walles/px
ps, top and pstree for human beings
Yangzhangcst/Transformer-in-Computer-Vision
A paper list of some recent Transformer-based CV works.