inspirehep/inspire-schemas

normalize_author_name discards name suffixes

michamos opened this issue · 2 comments

In [1]: from inspire_schemas.utils import normalize_author_name

In [2]: normalize_author_name('Smith, John Jr')
Out[2]: u'Smith, John'

The suffix is discarded. This shouldn't happen, the result should be Smith, John, Jr. (note the second comma and the dot).

Other case:
normalize_author_name('Smith, John III') == 'Smith, John, III' (no dot).

BTW, it might be good to configure nameparser http://nameparser.readthedocs.io/en/latest/customize.html#parser-customization-examples to remove stuff that is not needed for us.