readium/architecture

Parsing a subtitle

HadrienGardeur opened this issue · 5 comments

It seems that while title and sortAs are covered in the new parsing document, that's not the case for subtitle: https://github.com/readium/architecture/blob/master/streamer/parser/metadata.md

cc @JayPanoz

Ah yes, good catch. I was unsure about this one as there is no mapping in appendix B for EPUB 3.1 and a bunch of other keys e.g. letterer so I may have dropped some of those because of that in the initial document.

Maybe before I PR that, it could be a good idea to define which missing items should be handled in this doc?

Currently:

  • subtitle
  • inker
  • penciler
  • letterer
  • imprint
  • numberOfPages
  • duration

The appendix is probably not entirely up-to-date and it was based on EPUB 3.1, whereas we should primarily focus on 3.2 now.

For the contributors (inker, penciler and letterer), I think the main issue was that I couldn't identify the proper MARC relator codes for them.

For numberOfPages, I think that in EPUB 3.2 at least, the recommendation is to use schema.org as well.

duration is handled in EPUB Media Overlay for which we also need to handle a specific case for narrator.
active-class and playback-active-class are not currently handled in RWPM, I'll let @danielweck chime in on that.

As for imprint, we can probably ignore it for now in the context of EPUB.

Thanks, let me take a look for everything but media overlays. I can probably PR that shortly.

So yeah inker, penciler and letterer are missing in relator codes, as confirmed in this w3c mail, and looks like some people use illustrator for that.

From the last link… I guess we really don’t want to go down this rabbit hole for letterer:

I've been using "illustrator" for some of these (penciller, inker, cover artist) but I use
"calligrapher" for letterers
and "colorist" for colorists.

We probably don't want to go down that rabbit hole for any of them if there's no exact equivalent. They'll end up in contributor, which is fine.