/MTSamplesExtractor

Simple Python script that extracts the medical text from every page of MTSamples automatically and writes the contents to text files.

Primary LanguagePython

MTSamplesExtractor

Simple Python script that extracts the medical text from every page of MTSamples automatically and writes the contents to text files.

To launch the code, in the command line write:

python web_extract.py <text_folder_name>

where <text_folder_name> is the folder into which you want the text to be written to.