@[TOC]
A curated list of resources dedicated to Face Recognition & Detection, OCR, Objection Detection, Gan, 3D, Motion Track & Pose Estimation, ReID, NAS, Recommentation, Model Scaling. Any suggestions and pull requests are welcome.
- ReID
- Gan
- NAS
- SLAM
- Classification
- Recommendation & CTR
- Video Processing
- Building and Training
- Body Related
- Data Processing
- Objection Detection & Semantic
- Model Compress and Accelerate
- Motion & Pose
- Text Detection & Recognition
- [2019-CVPR] Bags of Tricks and A Strong Baseline for Deep Person Re-identification(Baseline)
paper
code
paper
- [2019-CVPR] Backbone Can Not be Trained at Once: Rolling Back to Pre-trained Network for Person Re-IdentificationRolling Back to Pre-trained Network for Person Re-Identification
paper
code
- [2019-CVPR] DBC: Dispersion based Clustering for Unsupervised Person Re-identification
paper
code
- [2019-CVPR] EANet: Enhancing Alignment for Cross-Domain Person Re-identification(***SOTA)
paper
code
- [2019-CVPR] High-level Semantic Feature Detection: A New Perspective for Pedestrian Detection
paper
https://github.com/liuwei16/CSP - [2019-CVPR] Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-identification
paper
code
- [2019-CVPR] MAR: Unsupervised Person Re-identification by Soft Multilabel Learning
paper
code
- [2019-CVPR] SSA-CNN: Semantic Self-Attention CNN for Pedestrian Detection(SOTA)
paper
- [2018-BMVC] Deep Association Learning for Unsupervised Video Person Re-identification
paper
code
- [
collection
] Awesome Generative Adversarial Networks with tensorflow**code
- [
framework
] Implementations of a number of generative models GAN, VAE, Seq2Seq, VAEGAN, GAIA, Spectrogram Inversion in Tensorflow**code
- [2019-CVPR] BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
paper
code
code-pytorch
- [2019-CVPR] StyleGan: Generator Inversion for Image Enhancement and Animation
paper
code
- [2018-ICLR] Progressive Growing of GANs for Improved Quality, Stability, and Variation
paper
code
)
- [
framework
] An open source AutoML toolkit for neural architecture search and hyper-parameter tuningcode
- [2019-CVPR] AutoGrow: Automatic Layer Growing in Deep Convolutional Networks
paper
code
- [2019-ar Xiv] MDENAS: Multinomial Distribution Learning for Effective Neural Architecture Search
paper
code
- [2019-CVPR] MnasNet: Platform-Aware Neural Architecture Search for Mobile
paper
code
- [2019-CVPR] Searching for A Robust Neural Architecture in Four GPU Hours
paper
code
- [2019-arXiv] Single-Path Mobile AutoML: Efficient ConvNet Design and NAS Hyperparameter Optimization
paper
code
- [2019-CVPR] Dynamic Distribution Pruning for Efficient Network Architecture Search
paper
code
- [
ToolBox
] OpenVSLAM: a Versatile Visual SLAM Frameworkcode
- [2019-CVPR] AdaptForStereo: Learning to Adapt for Stereo
paper
code
- [2019-arXiv] DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction
paper
code
- [2019-CVPR] Detailed Human Shape Estimation from a Single Image by Hierarchical Mesh Deformation
paper
code
- [2019-CVPR] Defusr: Learning Non-volumetric Depth Fusion using Successive Reprojections
code
- [2019-CVPR] GA-Net: Guided Aggregation Net for End-to-end Stereo Matching
paper
code
- [2019-CVPR] MegaDepth: Learning Single-View Depth Prediction from Internet Photos
paper
- [2019-CVPR] Neural Rerendering in the Wild
paper
code
code
- [2019-CVPR] PyRobot: An Open-source Robotics Framework for Research and Benchmarking
paper
code
- [2019-CVPR] Robust Point Cloud Based Reconstruction of Large-Scale Outdoor Scenes(3D reconstruction)
paper
code
- [2019-CVPR] SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation with Stacked Generative Adversarial Networks
paper
- [2019-CVPR] Taking a Deeper Look at the Inverse Compositional Algorithm(image alignment)
paper
code
- [
ToolBox
] Sandbox for training convolutional networks for computer vision (VGG,ResNet,PreResNet,ResNeXt,SENet,ResAttNet,SKNet,PyramidNet, - DenseNet,BagNet,MSDNet,FishNet,SqueezeNet,SqueezeResNet,SqueezeNext,ShuffleNet,ShuffleNetV2,MENet,MobileNet,FD-MobileNet,MobileNetV2,MobileNetV3,
Xception,InceptionV3,InceptionV4,InceptionResNetV2,PolyNet,NASNet-Mobile,PNASNet-Large,EfficientNet)
code
- [
ToolBox
] Classification models trained on ImageNetcode
code-keras
- [2019-CVPR] RepMet: Representative-based metric learning for classification and one-shot object detection
paper
- [2018-CVPR] SENet: Squeeze-and-Excitation Networks(champion for imageNet)
paper
code
code-caffe
- [2018-CVPR] FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction
paper
code
- [
ToolBox
] Implementation of Deep Learning based Recommender Algorithms with Tensorflowcode
- [
ToolBox
] A framework for training and evaluating AI models on a variety of openly available dialogue datasetscode
- [
ToolBox
] StarSpace: Embed All The Things!paper
code
- [
ToolBox
] Modular and Extendible package of deep-learning based CTR modelscode
- [
collection
] Classic papers and resources on recommendationpapers
- [
collection
] A collection of resources for Recommender Systemspapers
- [
collection
] papers,datas,outline for recommendationcode
code
- [2019-arXiv] Deep Learning Recommendation Model for Personalization and Recommendation Systems(***CTR)
paper
code
- [2019-arXiv] Generative Adversarial User Model for Reinforcement Learning Based Recommendation System
paper
- [2019-arXiv] Recent Advances in Diversified Recommendation
paper
- [2017-arXiv] Training Deep AutoEncoders for Collaborative Filtering(***SOTA)
paper
code
- [2019-CVPR] Video Classification
paper
code
- [2019-CVPR] FastDVDnet: Towards Real-Time Video denoising Without Explicit Motion Estimation(denoising)
paper
code
- [2019-CVPR] Hallucinating Optical Flow Features for Video Classification
paper
code
- [2019-CVPR] DAVANet: Stereo Deblurring with View Aggregation(debluring)
paper
code
- [2019-CVPR] DVDnet: A Simple and Fast Network for Deep Video Denoising(***SOTA)
paper
code
- [2019-CVPR] Deep Flow-Guided Video Inpainting
paper
code
- [2019-CVPR] EDVR: Video Restoration with Enhanced Deformable Convolutional Networks
paper
code
- [2019-CVPR] FastDVDnet: Towards Real-Time Video denoising Without Explicit Motion Estimation(denoising)
paper
code
- [2019-CVPR] TecoGAN: Temporally Coherent GANs for Video Super-Resolution
paper
code
- [2018-XXXX] A Deep Learning based project for colorizing and restoring old images and video!(***)
code
- [
ToolBox
] Pretrained EfficientNet, MobileNetV3 V2 and V1, MNASNet A1 and B1, FBNet, ChamNet, Single-Path NAScode
- [2019-CVPR] Aggregation Cross-Entropy for Sequence Recognition (The ACE loss function exhibits competitive performance to CTC)
paper
code
- [2019-CVPR] KL-Loss: Bounding Box Regression with Uncertainty for Accurate Object Detection
paper
code
- [2019-CVPR] Pacnet: Pixel-Adaptive Convolutional Neural Networks(new net constructure)
paper
code
- [2019-CVPR] ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
paper
- [
Toolbox
] A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learningcode
- [2019-CVPR]mixup: Bag of Freebies for Training Object Detection Neural Networks
paper
code
- [2019-CVPR] Improving Transferability of Adversarial Examples with Input Diversity
paper
code
- [2019-CVPR] RePr: Improved Training of Convolutional Filters
paper
- [2018-CVPR] Fd-mobilenet: Improved mobilenet with a fast downsampling strategy
paper
code
- [2019-CVPR] TedEval: A Fair Evaluation Metric for Scene Text Detectors(***)
paper
code
- [2019-CVPR] Tools for evaluating and visualizing results for the Multi Object Tracking and Segmentation (MOTS)
paper
code
- [
collection
] A curated list of related resources for hand pose estimation**code
- [
collection
] Face Benchmark and Datasetcode
- [
ToolBox
] A face recognition solution on mobile devicecode
- [2019-CVPR] Dense 3D Face Decoding over 2500FPS: Joint Texture & Shape Convolutional Mesh Decoders
paper
- [2019-CVPR] DSFD: Dual Shot Face Detector
paper
code
- [2019-CVPR] RetinaFace: Single-stage Dense Face Localisation in the Wild(***SOTA)
paper
code
- [2019-CVPR] PyramidBox++: High Performance Detector for Finding Tiny Face(***SOTA)
paper
code
- [2019-CVPR] SRN: Improved Selective Refinement Network for Face Detection(SOTA)
paepr
code
- [2018-arXiv] Face Alignment: How far are we from solving the 2D & 3D Face Alignment problem
paper
code
- [2018-CVPR] Look at Boundary: A Boundary-Aware Face Alignment Algorithm
paper
code
- [2018-ECCV] Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network
code
- [2019-CVPR] A Non-Intrusive Method of Face Liveness Detection Using Specular Reflection and Local Binary Patterns(Liveness Detection)
paper
- [2019-CVPR] FeatherNets: Convolutional Neural Networks as Light as Feather for Face Anti-spoofing(***Anti-spoofing)
paper
code
- [2019-CVPR] Liveness Detection Using Implicit 3D Features
paper
- [2019-CVPR] Disentangled Representation Learning for 3D Face Shape(3D face)
paper
code
- [2019-CVPR] Expressive Body Capture: 3D Hands, Face, and Body From a Single Image
paper
code
- [2019-CVPR] Learning to Regress 3D Face Shape and Expression From an Image Without 3D Supervision
paper
code
- [2019-CVPR] Monocular Total Capture: Posing Face, Body and Hands in the Wild
paper
code
- [2019-CVPR] MVF-Net: Multi-View 3D Face Morphable Model Regression(face reconstructing)
code
- [2019-CVPR] AdaFM: Modulating Image Restoration with Continual Levels via Adaptive Feature Modification Layers(denoising)
paper
code
- [2019-arXiv] AWSRN: Lightweight Image Super-Resolution with Adaptive Weighted Learning Network
paper
code
- [2019-CVPR] Deep Learning for Image Super-resolution: A Survey
paper
- [2019-CVPR] DPSR: Deep Plug-and-Play Super-Resolution for Arbitrary Blur Kernels
paper
code
- [2019-CVPR] Meta-SR: A Magnification-Arbitrary Network for Super-Resolution
paper
code
- [2019-arXiv] PASSRnet: Learning Parallax Attention for Stereo Image Super-Resolution
paper
code
- [2019-CVPR] SRNTT: Image Super-Resolution by Neural Texture Transfer
paper
code
- [2019-CVPR] Towards Real Scene Super-Resolution with Raw Images
paper
- [2018-CVPR] RCAN: Image Super-Resolution Using Very Deep Residual Channel Attention Networks
paper
code
- [
collection
] Awesome Generative Adversarial Networks with tensorflow**code
- [
framework
] Implementations of a number of generative models GAN, VAE, Seq2Seq, VAEGAN, GAIA, Spectrogram Inversion in Tensorflow**code
- [2019-CVPR] DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis
paper
github.com/NVlabs/SPADE) - [2019-CVPR oral] GauGAN: Semantic Image Synthesis with Spatially-Adaptive Normalization
paper
code
- [2019-CVPR] MSGAN: Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis
paper
code
- [2019-arXiv] MSG-GAN: Multi-Scale Gradients GAN for more stable and synchronized multi-scale image synthesis
paper
code
- [2019-argXiv] Self-Attention Generative Adversarial Networks
paper
code
- [2019-CVPR] Shapes and Context: In-the-wild Image Synthesis & Manipulation(Image Synthesis)
code
code
- [2019-CVPR] STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing
paper
code
- [2018-CVPR] High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs
paper
code
- [2019-CVPR] Image-to-Image Translation via Group-wise Deep Whitening-and-Coloring Transformation( )
paper
code
- [2018-CVPR] CycleGAN: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
paper
- [2018-CVPR] Pix2pix: Image-to-Image Translation with Conditional Adversarial Networks
paper
code
- [2019-CVPR] A Preliminary Study on Data Augmentation of Deep Learning for Image Classification
paper
- [2019-CVPR] Further advantages of data augmentation on convolutional neural networks
paper
- [2019-CVPR] Learning Data Augmentation Strategies for Object Detection
paper
- [2019-CVPR] PSIS: Data Augmentation for Object Detection via Progressive and Selective Instance-Switching
paper
code
- [2019-CVPR] Wide-Context Semantic Image Extrapolation(expand image)
paper
code
- [
ToolBox
] A Simple and Versatile Framework for Object Detection and Instance Recognitioncode
- [
ToolBox
] Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorchcode
- [
ToolBox
] ObjectionDetection by yolov2, tiny yolov3, mobilenet, mobilenetv2, shufflenet(g2), shufflenetv2(1x), squeezenext(1.0-SqNxt-23v5), light xception, xceptioncode
- [
ToolBox
] MMDetection: Open MMLab Detection Toolbox and Benchmarkpaper
code
- [
ToolBox
] Semantic Segmentation on PyTorch (include FCN, PSPNet, Deeplabv3, DANet, DenseASPP, BiSeNet, EncNet, DUNet, ICNet, ENet, OCNet, CCNet, PSANet, CGNet, ESPNet, LEDNet)code
code
- [
ToolBox
] Segmentation models with pretrained backbonescode
- [2019-CVPR] Activity Driven Weakly Supervised Object Detection
code
- [2019-CVPR] CenterNet: Objects as Points
paper
(***)code
- [2019-CVPR] Cascade R-CNN:High Quality Object Detection and Instance Segmentation(***SOTA)
paper
code
code-Caffe
- [2019-CVPR] CornerNet-Lite: Efficient Keypoint Based Object Detection(SOTA)
paper
code
- [2019-CVPR] DFPN: Efficient Object Detection Model for Real-Time UAV Applications
paper
code
code-Caffe
- [2019-CVPR] Distilling Object Detectors with Fine-grained Feature Imitation
code
- [2019-CVPR] ExtremeNet: Bottom-up Object Detection by Grouping Extreme and Center Points(***)
paper
code
- [2019-CVPR] FSAF: Feature Selective Anchor-Free Module for Single-Shot Object Detection(SOTA)
paper
- [2019-CVPR] FoveaBox: Beyond Anchor-based Object Detector(SOTA)
paper
- [2019-CVPR] FCOS: Fully Convolutional One-Stage Object Detection(***)
paper
code
- [2019-CVPR] Grid R-CNN Plus: Faster and Better
paper
code
- [2019-CVPR] Hybrid Task Cascade for Instance Segmentation
paper
code
- [2019-CVPR] Locating Objects Without Bounding Boxes(***crowd count)
paper
code
- [2019-CVPR] Learning Data Augmentation Strategies for Object Detection
paper
code
- [2019-CVPR] LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking
paper
code
- [2019-CVPR] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud(***SOTA)
paper
code
code-unofficial
- [2019-CVPR] TridentNet: Scale-Aware Trident Networks for Object Detection(***SOTA)
paper
code
- [2019-CVPR] NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
paper
code
- [2019-CVPR] Region Proposal by Guided Anchoring
paper
code
- [2019-CVPR] SNIPER: Efficient Multi-Scale Training
paper
code
- [2019-CVPR] SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection(fast and low power)
paper
- [2019-CVPR] ScratchDet: Training Single-Shot Object Detectors from Scratch
paper
code
- [2019-CVPR] YOLOv3+: Assisted Excitation of Activations: A Learning Technique to Improve Object Detectors
paper
code
- [2018-ECCV] Acquisition of Localization Confidence for Accurate Object Detection
paper
code
- [
Survey
] Salient Object Detection: A Surveypaper
- [2019-CVPR] A Mutual Learning Method for Salient Object Detection with intertwined Multi-Supervision
code
- [2019-CVPR] AFNet: Attentive Feedback Network for Boundary-aware Salient Object Detection
code
- [2019-CVPR] A Simple Pooling-Based Design for Real-Time Salient Object Detection
code
- [2019-CVPR] BASNet: Boundary-Aware Salient Object Detection
paper
code
- [2019-CVPR] Contrast Prior and Fluid Pyramid Integration for RGBD Salient Object Detection
paper
code
- [2019-CVPR] CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection
paper
code
- [2019-CVPR] Cascaded Partial Decoder for Fast and Accurate Salient Object Detection(***)
code
- [2019-CVPR] LFNet: Light Field Saliency Detection with Deep Convolutional Networks
paper
code
- [2019-CVPR] Pyramid Feature Attention Network for Saliency detection(***)
paper
code
- [2019-CVPR] Shifting More Attention to Video Salient Objection Detection
paper
code
- [2019-CVPR oral] CLAN: Category-level Adversaries for Semantics Consistent
paper
code
- [2019-CVPR] BRS: Interactive Image Segmentation via Backpropagating Refinement Scheme(***)
paper
code
- [2019-CVPR] DFANet:Deep Feature Aggregation for Real-Time Semantic Segmentation(used in camera)
paper
code
- [2019-CVPR] DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency
paper
code
- [2019-CVPR] Domain Adaptation(reducing the domain shif)
paper
- [2019-CVPR] ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation
paper
code
- [2019-CVPR oral] GLNet: Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images
paper
code
- [2019-CVPR] Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth(***SOTA)
paper
code
- [2019-ECCV] ICNet: Real-Time Semantic Segmentation on High-Resolution Images
paper
code
- [2019-CVPR] LEDNet: A Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation(***SOTA)
paper
code
- [2019-arXiv] LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation
paper
code
- [2019-CVPR] PTSNet: A Cascaded Network for Video Object Segmentation
paper
code
- [2019-CVPR] PPGNet: Learning Point-Pair Graph for Line Segment Detection
paper
code
- [2019-CVPR] Show, Match and Segment: Joint Learning of Semantic Matching and Object Co-segmentation
paper
code
- [2019-CVPR] Video Instance Segmentation
paper
code
- [2018-ECCV] BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation
paper
[code
](https://
- [
collection
] Collection of recent methods on DNN compression and acceleration https://github.com/MingSun-Tse/EfficientDNNs - [
collection
] A curated list of neural network pruning resources https://github.com/he-y/Awesome-Pruning - [
collection
] model compression and acceleration research papers https://github.com/cedrickchee/awesome-ml-model-compression - [
TollBox
] Neural Network Distiller by Intel AI Lab: a Python package for neural network compression researchcode
- [2019-CVPR] An Improved Trade-off Between Accuracy and Complexity with Progressive Gradient Pruning(Prune)
paepr
code
- [2019-ICML] EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
paper
code
code
- [2019-CVPR] FPGM: Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
paper
code
- [2019-CVPR] Importance Estimation for Neural Network Pruning
code
- [2019-CVPR] SKNet: Selective Kernel Networks
paper
code
- [2019-CVPR] SENet: Squeeze-and-Excitation Networks
paper
code
- [2019-CVPR] ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
paper
- [2019-CVPR] AlphaPose: Real-Time and Accurate Multi-Person Pose Estimation&Tracking System
paper
code
- [2019-CVPR] CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark
paper
code
- [2019-CVPR] Efficient Online Multi-Person 2D Pose Tracking with Recurrent Spatio-Temporal Affinity Fields(Oral)
paper
code
- [2019-CVPR] EpipolarPose: Self-Supervised Learning of 3D Human Pose using Multi-view Geometry
paper
code
- [2019-CVPR] Exploiting Temporal Context for 3D Human Pose Estimation in the Wild
paper
code
- [2019-CVPR] Generating Multiple Hypotheses for 3D Human Pose Estimation With Mixture Density Network(SOTA)
paper
code
- [2019-CVPR] Fast Human Pose Estimation(pytorch)
paper
code
- [2019-CVPR] High-Resolution Representation Learning for Human Pose Estimation(SOTA)
paper
code
- [2019-CVPR] Hand Shape and Pose Estimation from a Single RGB Image
paper
code
- [2019-CVPR] In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations
paper
- [2019-CVPR] VideoPose3D: 3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training
code
- [2019-CVPR] XNect: Real-time Multi-person 3D Human Pose Estimation with a Single RGB Camera
paper
- [2019-CVPR] ATOM: Accurate Tracking by Overlap Maximization(***SOTA)
paper
code
- [2019-IEEE] FANTrack: 3D Multi-Object Tracking with Feature Association Network
paper
code
- [2019-CVPR] Joint Monocular 3D Vehicle Detection and Tracking(***)
paper
code
- [2019-CVPR] Leveraging Shape Completion for 3D Siamese Tracking
paper
code
- [2019-CVPR Oral] Graph Convolutional Tracking(SOTA)
code
- [2019-arXiv] Instance-Aware Representation Learning and Association for Online Multi-Person Tracking
paper
- [2019-Github] multi-people tracking (centerNet based person detector + deep sort algorithm with pytorch)(SOTA)
code
- [2019-CVPR] PoseFix: Model-agnostic General Human Pose Refinement Network
paper
code
- [2019-CVPR Oral] Progressive Pose Attention Transfer for Person Image Generation
paper
code
- [2019-CVPR] PifPaf: Composite Fields for Human Pose Estimation
paper
code
code
- [2019-CVPR] SemGCN: Semantic Graph Convolutional Networks for 3D Human Pose Regression
paper
code
- [2019-CVPR] MVPOSE: Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views(multi-person)
paper
code
- [2019-CVPR] SiamMask: Fast Online Object Tracking and Segmentation: A Unifying Approach(***SOTA)
paper
code
- [2019-CVPR] SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks(***SOTA)
paper
code
- [2018-CVPR] OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation(***)
code
- [2019-CVPR] Arbitrary Shape Scene Text Detection with Adaptive Text Region Representation
paper
- [2019-CVPR] A Multitask Network for Localization and Recognition of Text in Images(end-to-end)
paper
- [2019-CVPR] AFDM: Handwriting Recognition in Low-resource Scripts using Adversarial Learning(data augmentation)
paper
code
- [2019-CVPR] CRAFT: Character Region Awareness for Text Detection
paper
code
- [2019-CVPR] Data Extraction from Charts via Single Deep Neural Network(*)
paper
- [2019-CVPR] E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
paper
- [2019-arXiv] FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition
paper
- [2019-CVPR] Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes
paper
- [2019-CVPR] PSENET: Shape Robust Text Detection with Progressive Scale Expansion Network
paper
- [2019-CVPR] PMTD: Pyramid Mask Text Detector
paper
code
- [2019-CVPR] Spatial Fusion GAN for Image Synthesis (word Synthesis) [
paper
](https://arxiv.org/abs/1812.05840code
- [2019-CVPR] Scene Text Detection with Supervised Pyramid Context Network
paper
- [2019-arXiv] TextField: Learning A Deep Direction Field for Irregular Scene Text Detection
paper
code
- [2019-CVPR] Typography with Decor: Intelligent Text Style Transfer
paper
code
- [2019-CVPR] TIOU: Tightness-aware Evaluation Protocol for Scene Text Detection(new Evalution tool)
paper
code
- [2019-arXiv] MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
paper
code
- [2019-CVPR] Scene Text Magnifier
paper
- [2018-CVPR] Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks
paper
- [2018-ECCV] Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
paper
code
- [2018-AAAI] PixelLink: Detecting Scene Text via Instance Segmentation
paper
code
- [2018-CVPR] RRPN: Arbitrary-Oriented Scene Text Detection via Rotation Proposals
paper
code