extractcitations

Use this handy script to extract citations from your PDF files

In order to run the test: Do these only the FIRST time:

Make sure you have Python 2x installed
Make sure you have the "refextract" library installed (install by opening terminal and typing: "pip install pyextract" NO QUOTES)

Do these every time you need to extract citations from a PDF document:

Add a selection of PDFs to the folder "PDFS_to_PROCESS" located in the root of this project
Open your TERMINAL from the ROOT/MAIN folder of this project
In the TERMINAL write: python parse.py
Wait for the results
Copy the contents of "drop_these_into_anystyledotio.txt"
In your browser visit "https://anystyle.io/"
Paste what you copied in step 6 into the box labelled, "Paste your references here"
Click parse
Once your job is completed scroll down and select from: BibText, CiteProc/JSON, XML DONE!

noveoko/extractcitations