original dimensions
Closed this issue · 0 comments
opyate commented
Hello, I asked this same question on HuggingFace a couple of months ago, but haven't had a reply yet.
Here are the features:
{'image_id': Value(dtype='int64', id=None),
'image': Image(decode=True, id=None),
'width': Value(dtype='int32', id=None),
'height': Value(dtype='int32', id=None),
'doc_category': Value(dtype='string', id=None),
'collection': Value(dtype='string', id=None),
'doc_name': Value(dtype='string', id=None),
'page_no': Value(dtype='int64', id=None),
'objects': [{'category_id': ClassLabel(names=['Caption', 'Footnote', 'Formula', 'List-item', 'Page-footer', 'Page-header', 'Picture', 'Section-header', 'Table', 'Text', 'Title'], id=None),
'image_id': Value(dtype='string', id=None),
'id': Value(dtype='int64', id=None),
'area': Value(dtype='int64', id=None),
'bbox': Sequence(feature=Value(dtype='float32', id=None), length=4, id=None),
'segmentation': [[Value(dtype='float32', id=None)]],
'iscrowd': Value(dtype='bool', id=None),
'precedence': Value(dtype='int32', id=None)}]}
Width and height are always 1025, which seems to correspond to the original dataset's "coco_width" and "coco_height".
However, I'm looking for the PDF pages' original widths and heights. Where is that included in this dataset? I'm aware it's in the auxiliary JSON files, but it would be more convenient to have it all in one place.
Closing, as it's now answered on HF: https://huggingface.co/datasets/ds4sd/DocLayNet/discussions/4