hugheylab/pmparser

Missing OtherAbstract

Closed this issue · 4 comments

Hello, first of all thank you for your hard work!

We noticed that for articles which include OtherAbstract instead of Abstract in the xml files, like the PMIDs shown below, PMDB does not contain this information.

[12284782, 791540, 5151212, 11366488, 12179850, 12348931, 12292916, 12289996, 1618509, 6124812, 12285555, 11364237, 11648557, 7180996, 12344165, 12345055, 11365689, 12316640, 12339404, 12179666, 4633320, 843562, 12290214, 11365935, 11363801, 12280582, 11363344, 12286398, 12326859, 12280826, 28871734, 12347415, 12345672, 11362381, 12159443, 12346164, 12335641, 12278300, 12344350, 12288160, 12281616, 3147680, 8236573, 1861699, 6594525, 487866, 12279163, 12340889, 12290163, 12314362]

(More PMIDs with the same problem and those from above are in otherabs.txt)

Interesting, thank you for raising this issue. I'll start working on a fix next week. My initial perusal of a relevant xml file suggests it should be relatively straightforward to parse this information.

Thank you very much! And thanks for replying so fast.

@alinangle The latest PR #72 should address this issue, so next month's version of PMDB will include additional tables other_abstract and other_id.

Thank you!