The following script parses all files in all subfolders of the specified main directory (e.g. Slack archive files) and extracts any URL related/pointing to a scientific article.
DISCLAIMER: Performance of the extraction is highly dependent on the provided excluder and includer keywords below
Usage is pretty straightforward, modify the input variables at the beginning of the script as you see fit and run it.
Copyright (C) 2020 NicolĂ s Palacio
Contact: nicolas.palacio@bioquant.uni-heidelberg.de
GNU-GLPv3:
This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation.
This program is distributed in the hope that it will be useful, but
WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
General Public License for more details.
A full copy of the GNU General Public License can be found on
http://www.gnu.org/licenses/.