psu-libraries/psulib_traject

Identify Base SUDOC call numbers

Closed this issue · 0 comments

Part of #285

In SUDOC, we can expect letters and numbers separated by one of: :-./ and possibly spaces. However, there MAY not be a space before the parts information, e.g.

  • D 101.47:v.14
  • C 3.2:H 62/8/v.2

These would be after the last separator of some kind, however.

We should expect the most common volume, issue, and number, v. iss. vol. num. etc words and we only expect English on these.

Also, if we are removing things like "KIT" from call numbers, we will want to bring that over here, e.g..

  • HE 20.427:C 76/2/KIT.
  • HE 20.427:H 33
  • HE 20.427:H 75/PACK.
  • HE 20.427:IL 6/KIT
  • HE 20.427:IN 7

WDLL

  • Generate base call numbers for SUDOC call numbers