document-content-extraction

There are 1 repositories under document-content-extraction topic.

  • ispras/dedoc

    Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

    Language:Python595133044