Pinned Repositories
community
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
pipeline-paddleocr
Pipeline for converting PDFs to raw text with PaddleOCR
pipeline-sec-filings
Preprocessing pipeline notebooks and API supporting text extraction from SEC documents
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
unstructured-api
unstructured-api-tools
unstructured-inference
unstructured-js-client
A Typescript client for the Unstructured hosted API
unstructured-python-client
A Python client for the Unstructured hosted API
unstructured.PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Unstructured's Repositories
Unstructured doesn’t have any repository yet.