surelyee's Stars
clovaai/bros
tstanislawek/awesome-document-understanding
A curated list of resources for Document Understanding (DU) topic
jpWang/LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
TrevorIkky/RegNet
Implementation of RegNet which builds on ResNet by adding an RNN-Regulator Module to preserve the spatio-temporal data between residual blocks.
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
ju-chen/Efficient-Prompt
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
DirtyHarryLYL/Transformer-in-Vision
Recent Transformer-based CV and related works.
DS3Lab/TableParser
Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22
abhijithneilabraham/tableQA
AI Tool for querying natural language on tabular data.
clovaai/spade
hikopensource/DAVAR-Lab-OCR
OCR toolbox from Davar-Lab
sotowang/zmyy-seckill
知苗易约抢购
whai362/PVT
Official implementation of PVT series
dongzhang89/FPT
Implementation for paper: Feature Pyramid Transformer
wenwenyu/PICK-pytorch
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
chineseocr/chineseocr
yolo3+ocr
BookerDeWitt/MSBDN-DFF
The source code of CVPR 2020 paper "Multi-Scale Boosted Dehazing Network with Dense Feature Fusion"
amusi/Deep-Learning-Interview-Book
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
bobwan1995/PMFNet
Implementation of "Pose-aware Multi-level Feature Network for Human Object Interaction Detection"(ICCV 2019 Oral)
pytorch/vision
Datasets, Transforms and Models specific to Computer Vision
NVlabs/planercnn
PlaneRCNN detects and reconstructs piece-wise planar surfaces from a single RGB image
mehtadushy/SelecSLS-Pytorch
Reference ImageNet implementation of SelecSLS CNN architecture proposed in the SIGGRAPH 2020 paper "XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera". The repository also includes code for pruning the model based on implicit sparsity emerging from adaptive gradient descent methods, as detailed in the CVPR 2019 paper "On implicit filter level sparsity in Convolutional Neural Networks".
mks0601/3DMPPE_POSENET_RELEASE
Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019
mks0601/3DMPPE_ROOTNET_RELEASE
Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019
mks0601/PoseFix_RELEASE
Official TensorFlow implementation of "PoseFix: Model-agnostic General Human Pose Refinement Network", CVPR 2019
surelyee/Spring-Boot-MVC
敏捷web作业
Jeff-sjtu/CrowdPose
CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark, CVPR 2019, Oral
NieXC/pytorch-ppn
Pytorch implementation of Pose Partition Networks for Multi-Person Pose Estimation (ECCV'18)