poor text handling

Question

pseudotensor opened this issue a year ago · 1 comments

I have searched the Github Discussion and issues and have not found anything similar to this.

ubuntu 20
python 3.10
torch 2.1.2
cuda 12.1

No repetition and description of image's text

prompt:

Describe the image in detail.

Tried many other prompts, same kinds of failures.

No response

Answer 1 · 2024-02-04T07:20:08.000Z

The current version of the model does not have enhancements for OCR capabilities at the moment.