Issues
- 0
Better license capture in crossref parser
#151 opened by seasidesparrow - 1
- 0
JATS parser will miss publication name if the publisher embeds it using the `<abbrev-journal-title>` tag without including `<journal-title>`
#138 opened by seasidesparrow - 2
Do not unescape < and > in XML output
#126 opened by ehenneken - 2
- 0
- 0
- 0
SPRINGER: Repeated collaboration
#114 opened by mugdhapolimera - 0
Pub month versus collection month
#122 opened by seasidesparrow - 0
- 0
Some additional email addresses from MDPI/JATS could be captured with special handling
#116 opened by seasidesparrow - 1
ELSEVIER: Maintaining roman numbers
#108 opened by mugdhapolimera - 0
DataCite parser should support detection of multiple names in one `<creatorName>` tag
#112 opened by seasidesparrow - 1
- 0
- 0
- 3
- 0
Elsevier: Translate markup in input files to standard markup for super/sub scripts
#99 opened by mugdhapolimera - 1
ELSEVIER: JHEAp not parsing
#58 opened by csgrant00 - 4
Elsevier: loss of formatting in abstract
#57 opened by csgrant00 - 1
ELSEVIER: subscripts/superscripts not parsing
#59 opened by csgrant00 - 1
Elsevier parser is unable to parse records with doctype `<ja:simple-article>`
#92 opened by seasidesparrow - 1
Name and collaboration post-processing
#95 opened by seasidesparrow - 1
IOP: suffixes not being properly rendered
#94 opened by csgrant00 - 0
Mononyms aren't being handled correctly by the utils.AuthorNames module in some cases
#88 opened by seasidesparrow - 0
Elsevier parser will fail on keyword parsing if the tag `<ce:keywords>` does not exist
#87 opened by seasidesparrow - 1
- 1
- 0
All parsers including JATS should populate the fulltext item depending on whether the publisher supplied file has a "<body>" tag or equivalent.
#79 opened by seasidesparrow - 0
IUCr record does not output title
#81 opened by mugdhapolimera - 0
- 0
Copernicus pagination missing
#76 opened by mugdhapolimera - 2
no output generated
#74 opened by csgrant00 - 2
- 0
- 0
- 0
- 0
- 0
Crossref parser needs to be able to handle records of "posted_content" type
#53 opened by seasidesparrow - 0
Abstracts in jats files with multiple embedded <p> tags are dropping all subsequent to the first
#50 opened by seasidesparrow - 0
JATS parser can fail if <aff> tag has an embedded <ext-link> without an "id" attribute
#51 opened by seasidesparrow - 1
Author affiliation data available from Crossref should be parsed into a contrib.affiliation array
#44 opened by seasidesparrow - 0
AVRO schema key names cannot contain "-".
#48 opened by tjacovich - 0
Pyproject requirement for setuptools==60.10.0 incompatible with ADSPipelineUtils
#46 opened by seasidesparrow - 1
- 0
- 0
bug: datacite parser can fail when trying to access non-existent tag attributes
#33 opened by seasidesparrow - 1
- 1
Author name parser is failing in some cases, possibly unicode-related (ccaron)
#25 opened by seasidesparrow - 0
Crossref Parser doesn't populate pagination if ids rather than pages are given
#20 opened by seasidesparrow