clovaai/donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

PythonMIT

Issues

Can synthdog insert text for a specified bbox?
#299 opened 2 months ago by Supercarry-khr
1
Where is the fine-tuned model?
#300 opened a month ago by sdip15fa
0
not work this app.py
#298 opened 2 months ago by sssssshf
0
How to handle multi page invoices
#279 opened 5 months ago by DNFSiF
4
Problem with getting predictions
#287 opened 4 months ago by jdrzj
6
Not getting prediction correctly using the model trained on the custom dataset (similar format as CORD-V2 dataset)
#297 opened 2 months ago by SiriusPoint
0
donut inference시 sub task가 변경?
#296 opened 2 months ago by kittyLunar
0
Random prediction and wrong prediction in repeated characters
#270 opened 7 months ago by Asha-12502
4
Update donut-python Python Package to be compatible with latest versions of transformers
#295 opened 2 months ago by agovin9812
0
Classification inference
#294 opened 3 months ago by SNavgale
0
Input type (float) and bias type (struct c10::BFloat16) should be the same
#267 opened 7 months ago by Coder-Vishali
4
The provided lr scheduler `LambdaLR` doesn't follow PyTorch's LRScheduler API. You should override the `LightningModule.lr_scheduler_step` hook with your own logic if you are using a custom LR scheduler.
#255 opened 9 months ago by goseesomething
2
confidence 값의 공식적인 지원
#293 opened 3 months ago by HwangSeyoon
0
How to extract complete text from the document?
#292 opened 3 months ago by vikasr111
3
Complete text
#253 opened 9 months ago by VagnerBelfort
3
Multi GPU support for fine tuning
#291 opened 4 months ago by SNavgale
0
details is not ideal
#258 opened 8 months ago by chopin1998
5
VisionEncoderDecoderModel convert
#284 opened 5 months ago by sjtu-cz
2
custom json schema - ASAP
#290 opened 4 months ago by crazycoderF12
2
getting no module named lightning module when trying to run the fine tuning code in train.py file of donut model.
#280 opened 5 months ago by svocdfrockz
1
Question about the special token map
#274 opened 7 months ago by RAY-RaY-R
1
Error "A configuraton of type donut cannot be instantiated because not both `encoder` and `decoder` sub-configurations are passed" when run inference after finetuned docvqa without pushing to hugging face?
#289 opened 4 months ago by phuchm
0
Does synthdog data has MiT or afl-3.0 license?
#288 opened 4 months ago by becxer
1
Donut Return Output even With Blank Image
#272 opened 7 months ago by wdprsto
5
Idea: Freezing SwinEncoder and fine-tuning BARTdecoder only on custom data
#275 opened 7 months ago by jackkwok
4
DOCVQA data set format ?
#281 opened 5 months ago by tzktz
3
Request: Dataset and pretrained model for language detection
#286 opened 4 months ago by turian
0
fine-tuning on docvqa ,anls only 40%
#265 opened 7 months ago by ShuoZhang2003
1
Integrate a customized internal OCR engine to Donut
#285 opened 4 months ago by Altimis
1
Two types of documents in one model?
#256 opened 8 months ago by henkish
1
Bounding boxes required for pretraining?
#277 opened 6 months ago by mustaszewski
1
Prediction and Answer differ by dataset-specific tag
#282 opened 5 months ago by ftkeys
1
The latest update has the model weights twice the embedding dim size of the actual model installed through github or pip
#283 opened 5 months ago by Samartha27
2
Trying to run DOCVQA dataset
#278 opened 5 months ago by srgautam9
0
Could not find image processor class in the image processor config or the model config.
#276 opened 6 months ago by felixnguyen258
1
How to config synthdog for much more longer text, like total length about 1024-2048
#249 opened 9 months ago by CheungZeeCn
2
Simple questions answering
#271 opened 7 months ago by shersoni610
0
dataset script missing error
#250 opened 9 months ago by segaranp
1
Where can I find the dataset used for training Document Visual Question Answering model
#269 opened 7 months ago by Coder-Vishali
0
json2token performance
#266 opened 7 months ago by benjaminfh
1
Are there any acceleration solutions for donut deployment?
#264 opened 7 months ago by sjtu-cz
0
Can donut support batch inference?
#262 opened 7 months ago by sjtu-cz
2
Dataset Loader didn't work properly on Kaggle
#263 opened 8 months ago by wdprsto
1
Documentation for Synthdog?
#261 opened 8 months ago by parthch11
0
Model Consistently Mispredicting Specific Character in Invoice Number
#260 opened 8 months ago by Codedrainer
0
How to annotate and train donut for extracting all dates (unknown number of dates)
#259 opened 8 months ago by Anas-Khayata
0
Is UIPATH AI Center Document Understanding Process also OCR free like Donut
#257 opened 8 months ago by Sridhar-Ranganaboina
0
CORD-v2 accuracy much lower than the paper's results
#254 opened 9 months ago by kevinmeetooa
1
Couldn't connect to 'https://huggingface.co'
#252 opened 9 months ago by mmhzlrj
1
Handling Variable Element Presence in Parsing Document Task
#248 opened 9 months ago by abdelaziz-jaddi
0