Not retrieving the correct files from the ncbi FTP
Closed this issue · 0 comments
rafspiny commented
The regular expression used in MEDOC.py is not capturing the files on the FTP.
It should be something like this:
self.regex_gz = re.compile('^pubmed.*.xml.gz$')
This is preventing the code from downloading any file from the FTP.
A solution is already available in the pull request I created.