Ardio (a python package) converts academic journal articles into mp3 files for listening at leisure. It removes figures, references and other contents that cannot be 'listened to.' The idea is to use it as a base application for machine learning to separate useful stuff from the rest.
pip install ardio
ardio input.pdf output.mp3