madmaxoft/SkauTan

Clean up song metadata

madmaxoft opened this issue · 2 comments

Some songs have metadata remnants of bad splitting, such as " - " at the front of their Author field, etc. Similar metadata artifacts are created with the filename-to-metadata matcher. All these artifacts should be cleaned up while scanning the song metadata.

  • Spaces
  • Dashes
  • Empty parentheses, brackets and braces
  • (numbers?)

Additionally, some items should be scrubbed from anywhere in the string

  • "Various Artists", "VA", "Various"
  • "Unknown Artist" et al.
  • "Neznámý interpret"
  • "Track", "Stopa"
  • "Copy of", "Kopie"

Also noteworthy: The current matcher is also a bit too greedy when the song name contains a genre name as well, such as The Godfather Waltz, removing the genre name. This needs some tweaking, too.

It might be possible to detect situations when both the author and title are stuffed into a single field, and the other field is empty / has had other info that was removed during scanning. For example:
ID3Author = SW 29, ID3Title = author - title