The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.
Primary LanguagePythonApache License 2.0Apache-2.0