Pinned Repositories
AON
Implementation for CVPR 2018 text recognition Paper by Tensorflow: "AON: Towards Arbitrarily-Oriented Text Recognition"
atr
Attentional Text Recognizer, based on ASTER improvements to suit actual use.
catvision
A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.
ComfyUI_Easy_Nodes_hui
ctc_pro
CNN + Non-local +CTC for text recognition
ocr_tools
Given four points of a polygon, crop out the closest bounding box rectangle from original image.
papers_in_cv
A papers list in computer vision fields.
SynthLogo
Synthesising Context Logo Images
text_deblur
Reducing the blurring noise of text images by Convolutional Neural Networks
AutoSTR
H. Zhang, Q. Yao, M. Yang, Y. Xu, X. Bai. AutoSTR: Efficient Backbone Search for Scene Text Recognition. European Conference on Computer Vision (ECCV). 2020.
huizhang0110's Repositories
huizhang0110/catvision
A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.
huizhang0110/papers_in_cv
A papers list in computer vision fields.
huizhang0110/atr
Attentional Text Recognizer, based on ASTER improvements to suit actual use.
huizhang0110/4p-hackason-3D-camera
3D相机加持下的动态特效生成
huizhang0110/ComfyUI_Easy_Nodes_hui
huizhang0110/awesome-self-supervised-learning
A curated list of awesome self-supervised methods
huizhang0110/AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
huizhang0110/Awesome-Crowd-Counting
Awesome Crowd Counting
huizhang0110/BossNAS
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
huizhang0110/const_layout
Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layout evaluation)
huizhang0110/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
huizhang0110/DivideMix
Code for paper: DivideMix: Learning with Noisy Labels as Semi-supervised Learning
huizhang0110/ELR
Official Implementation of Early-Learning Regularization Prevents Memorization of Noisy Labels
huizhang0110/FasterSeg
[ICLR 2020] "FasterSeg: Searching for Faster Real-time Semantic Segmentation" by Wuyang Chen, Xinyu Gong, Xianming Liu, Qian Zhang, Yuan Li, Zhangyang Wang
huizhang0110/gitignore
A collection of useful .gitignore templates
huizhang0110/GPU-Efficient-Networks
huizhang0110/HowToLiveLonger
程序员延寿指南 | A programmer's guide to live longer
huizhang0110/kiss
Code for the paper "KISS: Keeping it Simple for Scene Text Recognition"
huizhang0110/mmgeneration
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
huizhang0110/MulimgViewer
MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.
huizhang0110/NASP-codes
huizhang0110/pren
Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)
huizhang0110/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
huizhang0110/pytoshop
Library for reading and writing Photoshop PSD and PSB files
huizhang0110/RerankingTransformer
[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers
huizhang0110/sam
SAM: Sharpness-Aware Minimization (PyTorch)
huizhang0110/Serving
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
huizhang0110/TensorRT_Tutorial
huizhang0110/TREFE
huizhang0110/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/