pdf-miner

There are 8 repositories under pdf-miner topic.

ktaaaki/paper2html
Converts a single/double-column PDF formatted paper into a html page, which has the original view & the paragraph view extracted from the paper for translation from the browser.
Language:Python24 1 164
luke-cha/diff-pdf
Compare PDF documents using PDF Miner and print out the differences as HTML documents
Language:HTML14 4 08
swainshashwat/Flock
Craft custom Language Model Models (LLMs) effortlessly using Flock. Build LLMs for specific domains like a pro, supported by wizardlm, bloom, falcon, and llama. Extract insights from text and images seamlessly. Powered by Python, pdfMiner, langChain, and streamLit. Unlock domain-specific intelligence with Flock! 🚀
Language:Jupyter Notebook4 2 03
department-of-veterans-affairs/DAPM-PFAS-PACT-ACT
Scrapes hazardous waste data from a website and PDF file for PACT Act. Cleans the data to prepare it for mapping.
Language:Jupyter Notebook1 2 02
plain-jane-gray/PFAS-web-and-PDF-scrape
Scrapes hazardous waste data from a website and PDF file. Cleans and analyzes the data. Prepares the data for mapping.
Language:Jupyter Notebook1 1 00
TheurgicDuke771/pdf_compare
Compare PDF documents using PDF Miner and print out the differences as HTML documents
Language:Python1 0 00
MyreLab/python_filereader
Data management automation tool. PyPDF2 reads unique identifiers from files and the OS library renames the files in-place with each corresponding identifier.
Language:Jupyter Notebook0 1 00
ritikkanswal/resume-filter
This Resume Filter is used for filtering resumes according to keywords of the recruiter. It is already hosted on Heroku Check it.
Language:CSS0 1 02

pdf-miner

ktaaaki/paper2html

luke-cha/diff-pdf

swainshashwat/Flock

department-of-veterans-affairs/DAPM-PFAS-PACT-ACT

plain-jane-gray/PFAS-web-and-PDF-scrape

TheurgicDuke771/pdf_compare

MyreLab/python_filereader

ritikkanswal/resume-filter