
forked from the scraperwiki pdftables (0.0.4) project which was removed Github

Primary LanguagePython


A library for extracting tables from PDF documents. pdftables is a fork from pdftables (0.0.4) which was developed by ScraperWiki.


  • TODO (for now)
    • make it work with the latest version of pdfminer (20140328)
    • tidy up code base including PEP 8 compliance
    • review test cases and identify a set of pdf files to use for testing
    • Add some documentation
    • Confirm that Scraper Wiki is no longer interested in this and if this is the case change name of the package for release on PyPI