demonzyj56

Ph.D. in computer vision.

Nanyang Technological University

demonzyj56's Stars

HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Language:JavaScript17.3k2.1k
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.4k89
deepghs/imgutils
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
Language:Python1368
ultralytics/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Language:Python25.8k5.1k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Language:Python3.9k301
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python2.7k197
beichenzbc/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
Language:Python44822
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.1k73
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.6k236
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
Language:Jupyter Notebook2.2k203
lyst/lightfm
A Python implementation of LightFM, a hybrid recommendation algorithm.
Language:Python4.7k679
jinyeying/DC-ShadowNet-Hard-and-Soft-Shadow-Removal
[ICCV2021]"DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network", https://arxiv.org/abs/2207.10434
Language:Python20419
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java29.7k2.2k
Benjamin-Loison/YouTube-operational-API
YouTube operational API works when YouTube Data API v3 fails.
Language:PHP34441
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python76.3k6k
tencentyun/cos-python-sdk-v5
Language:Python183126
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python34.5k5.3k
kanosawa/anime_face_landmark_detection
Anime face landmark detection by deep cascaded regression
Language:Python22617
abhishekkrthakur/approachingalmost
Approaching (Almost) Any Machine Learning Problem
6.8k1k
zengyh1900/Awesome-Image-Inpainting
A curated list of image inpainting and video inpainting papers and resources
Language:Python1.8k248
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Language:Jupyter Notebook7.5k812
naoto0804/pytorch-inpainting-with-partial-conv
Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions' [Liu+, ECCV2018]
Language:Python577135
chat2db/Chat2DB
🔥🔥🔥AI-driven data management platform Over 1 million developers are using Chat2DB
Language:Java14.2k1.6k
bcmi/Awesome-Visible-Watermark-Removal
9910
tanimutomo/partialconv
Re-Implementation of "Image Inpainting for Irregular Holes using Partial Convolution"
Language:Python6716
facebookresearch/ssl_watermarking
Official implementation of "Watermarking Images in Self-Supervised Latent-Spaces"
Language:Python8410
benfred/implicit
Fast Python Collaborative Filtering for Implicit Feedback Datasets
Language:Python3.5k607
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.1k2.9k
walles/px
ps, top and pstree for human beings
Language:Python2349
Yangzhangcst/Transformer-in-Computer-Vision
A paper list of some recent Transformer-based CV works.
1k124

demonzyj56

demonzyj56's Stars

HumanSignal/label-studio

cambrian-mllm/cambrian

deepghs/imgutils

ultralytics/ultralytics

OpenGVLab/InternVL

Tencent/HunyuanDiT

beichenzbc/Long-CLIP

OpenGVLab/InternVideo

DAMO-NLP-SG/Video-LLaMA

rom1504/clip-retrieval

lyst/lightfm

jinyeying/DC-ShadowNet-Hard-and-Soft-Shadow-Removal

Stirling-Tools/Stirling-PDF

Benjamin-Loison/YouTube-operational-API

yt-dlp/yt-dlp

tencentyun/cos-python-sdk-v5

karpathy/nanoGPT

kanosawa/anime_face_landmark_detection

abhishekkrthakur/approachingalmost

zengyh1900/Awesome-Image-Inpainting

advimman/lama

naoto0804/pytorch-inpainting-with-partial-conv

chat2db/Chat2DB

bcmi/Awesome-Visible-Watermark-Removal

tanimutomo/partialconv

facebookresearch/ssl_watermarking

benfred/implicit

Vision-CAIR/MiniGPT-4

walles/px

Yangzhangcst/Transformer-in-Computer-Vision