shuyansy
In HIT Researching in machine learning and computer vision
Harbin Institude of TechnologyChina
Pinned Repositories
Detect-and-read-meters
This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.
Efficient-Ambiguous-Text-Detector
An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene Text Detector" (ACM MM 2023)
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
MaskTextSpotter
A PyTorch implementation of Mask TextSpotter
meeting_reports
some presentations for weekly meetings
multilingual-machine-translation
This is some code for multilingual machine translation (English, Korean, Japanese, Arabic)
Pattern-Recognition-Algorithm
Re-implementation of some classical algorithm in pattern recognition
scripts-for-image-processing
some practical demos for image/ text processing
Survey-of-Visual-Text-Processing
The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"
Synthesis-multilingual-handwritten-text-data
This is a simple yet method focused on handwritten text dataset generation, which is beneficial for handwritten text detection and segmentation
shuyansy's Repositories
shuyansy/Detect-and-read-meters
This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.
shuyansy/Survey-of-Visual-Text-Processing
The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"
shuyansy/Efficient-Ambiguous-Text-Detector
An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene Text Detector" (ACM MM 2023)
shuyansy/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
shuyansy/Pattern-Recognition-Algorithm
Re-implementation of some classical algorithm in pattern recognition
shuyansy/MaskTextSpotter
A PyTorch implementation of Mask TextSpotter
shuyansy/meeting_reports
some presentations for weekly meetings
shuyansy/multilingual-machine-translation
This is some code for multilingual machine translation (English, Korean, Japanese, Arabic)
shuyansy/scripts-for-image-processing
some practical demos for image/ text processing
shuyansy/Synthesis-multilingual-handwritten-text-data
This is a simple yet method focused on handwritten text dataset generation, which is beneficial for handwritten text detection and segmentation
shuyansy/TextBorder
official implementation for paper 《BAG:Learning Border Attraction Grouping for Arbitrary-shaped Scene Text Detection》
shuyansy/TRM_tutorial
Transformer在CV和NLP领域的变体模型的从零解读:Transformer;VIT;Swin Transformer
shuyansy/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
shuyansy/cnstd
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)Python3 包
shuyansy/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
shuyansy/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
shuyansy/ICV
Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
shuyansy/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
shuyansy/MovieChat
[CVPR 2024] 🎬💭 chat with over 10K frames of video!
shuyansy/OCR_DataSet
收集并整理有关OCR的数据集并统一标注格式,以便实验需要
shuyansy/OpenAI-CLIP
Simple implementation of OpenAI CLIP model in PyTorch.
shuyansy/paper_downloader
Download papers and supplemental materials from open-access paper website, such as AAAI, ACCV, AISTATS, COLT, CVPR, ECCV, ICCV, ICLR, ICML, IJCAI, JMLR, NIPS.
shuyansy/research-charnet
CharNet: Convolutional Character Networks
shuyansy/shuyansy.github.io
shuyansy/text_renderer
Generate text images for training deep learning ocr model
shuyansy/ViTAE-Transformer-Scene-Text-Detection
[IJCV2022]This is the repo for the paper "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection".