MTSamplesExtractor
Simple Python script that extracts the medical text from every page of MTSamples automatically and writes the contents to text files.
To launch the code, in the command line write:
python web_extract.py <text_folder_name>
where <text_folder_name> is the folder into which you want the text to be written to.