18F/doc_processing_toolkit
Python library to extract text from PDF, and default to OCR when text extraction fails.
PythonNOASSERTION
Stargazers
- adelevie@casetext
- ajman1101Denver, CO
- arowlaStitch Fix
- bahadasxWashington, D.C.
- catherinedevlinCorning
- cloupid
- davebrazeWest Haven, CT
- derekrazo
- djeraseitTheodis Butler
- edmooneyPennsylvania
- Feng-GaoShanghai,China
- flowerinheart
- fureighFureigh Consulting
- geramirez@github
- gh0stwizardTashkent, Uzbekistan
- JaredHalpern
- Jdi99y515
- jkolbert-zz
- joewizArlington, Virginia
- jyw2116
- McCroden
- MDMollPhiladelphia, PA
- MikeTriznaSmithsonian Institution
- nateritter@Perfect-Space
- nkwoodUSA
- number0
- pkt
- pmaxwell
- reduxionist@intelligent-bytes
- rkip
- roshammar
- sivartravisCambridge, MA
- skiloopShenzhen
- unbaiatUnicorns United Ltd
- veseloskyAtlanta, GA USA
- williamnJakarta, Indonesia