normalize_author_name discards name suffixes

Question

michamos opened this issue 7 years ago · 2 comments

In [1]: from inspire_schemas.utils import normalize_author_name

In [2]: normalize_author_name('Smith, John Jr')
Out[2]: u'Smith, John'

The suffix is discarded. This shouldn't happen, the result should be Smith, John, Jr. (note the second comma and the dot).

Answer 1 · 2017-08-23T07:57:19.000Z

Other case:
normalize_author_name('Smith, John III') == 'Smith, John, III' (no dot).

Answer 2 · 2017-08-23T08:42:55.000Z

BTW, it might be good to configure nameparser http://nameparser.readthedocs.io/en/latest/customize.html#parser-customization-examples to remove stuff that is not needed for us.