A captcha solver for SJTU based on ViT and some preprocessing tricks. The accuracy of character classification is 99.6%.
pip install -r requirements.txt
python inference.py --image image.png --model model.pth
python main.py --path dataset
The dataset is from here
- Segment the captcha into some images with one character.
- Split the dataset into train and test sets.