LinYuOu's Stars
sail-sg/MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
xiaobai1217/Awesome-Video-Datasets
Video datasets
NeeluMadan/ViFM_Survey
Foundation Models for Video Understanding: A Survey
EdisonLeeeee/Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
RLGen/LakeBench
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
SuperCV/Book
:green_book:我的个人书籍学习和收藏
KangLiao929/Awesome-Deep-Camera-Calibration
Deep Learning for Camera Calibration and Beyond: A Survey
thohemp/6DRepNet
Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.
NVlabs/FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
LeeJunHyun/Image_Segmentation
Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.
cszn/KAIR
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
ignacio-rocco/cnngeometric_pytorch
CNNGeometric PyTorch implementation
FISCO-BCOS/FISCO-BCOS
FISCO BCOS(发音为/ˈfɪskl bi:ˈkɒz/)是一个稳定、高效、安全的许可区块链平台,已被广泛应用于现实的行业应用。截至目前,已拥有5000多家企事业单位,400多个产业数字化标杆应用,涵盖文化版权、司法服务、政府服务、物联网、金融、智慧社区、房地产建设、社区治理、乡村振兴等领域。FISCO BCOS (pronounced /ˈfɪskl bi:ˈkɒz/) is a stable, efficient, and secure permissioned blockchain platform that has been widely used in real-world industry applications.
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
cutethotw/ClashRule
clash自用分流规则
onlyliucat/Multi-chessboard-Corner-extraction-detection-
chess board corner extraction and chess board recovery "Automatic Camera and Range Sensor Calibration using a single Shot"
eric-yyjau/pytorch-superpoint
Superpoint Implemented in PyTorch: https://arxiv.org/abs/1712.07629
NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
MenghaoGuo/Awesome-Vision-Attentions
Summary of related papers on visual attention. Related code will be released based on Jittor gradually.
ilguneray/Chessboard_digitizer
Chessboard digitization with neural network corner detection and yolov8 object detection.
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
zhou13/lcnn
LCNN: End-to-End Wireframe Parsing
Liber-coder/CV_Notes
「3D视觉(三维重建、SLAM、AR/VR) + 传统图像处理 + 计算机视觉(偏AI) 」重要知识点和面试问题。
0voice/expert_readed_books
2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,**类,数学类,人物传记书籍