hugheylab/pmparser

Missing titles in the article table

Closed this issue · 2 comments

Hello again,

I noticed that several records for the publications don't contain the title in the article table.
While there are no empty fields, there are 29,172 empty strings (august version) despite the title being present in PubMed for the given PMIDs:

image

The same can also be observed via Google Big Query:
image

Is this a parsing error from the XML files?

It's not a parsing error. Those articles actually don't have a title. They're not written in English, though, so they have a vernacular title, which must be what PubMed uses. I've revised pmparser to parse the Language and VernacularTitle fields dd71807, and these will be in next month's version of PMDB.

Okay, thanks for the explanation and the upcoming update!