emk/subtitles-rs

Split audio according to subtitle timing exactly for songs

Arthaey opened this issue · 5 comments

The default algorithm for splitting up audio works well enough for TV shows, where splitting up a sentence isn't really the end of the world.

For my weaker languages, I had the idea to use substudy with songs instead of TV. But it's going to be really weird and annoying to split up lyrics that way, especially when they're already "bite-sized".

So here I am, asking for yet another option to be added to substudy. ;)

PS: I don't know if you care about github issue labels, but "bug" vs "feature request" might be a thing to think about if you do. :)

emk commented

Hmm, yeah, I've gone back and forth for the right audio clip size for songs. Certainly, the ±1.5 seconds thing is annoying the first time you listen through the song, because you hear a lot of repetition.

But on later reviews, the audio will get shuffled out of order and you'll normally hear it just one or two lines of lyrics per day. And once you reach that point, using exact timings means that each lyric will be completely deconnected from context. I've done a lot of songs this way, and at best, it's... meh.

So I would encourage you to at least try lyrics with the same padding used for TV episodes, and wait until the cards at at least 30 days old, shuffled and separated, and tell me if you still feel like the padding is the wrong decision.

But "try it for a while before changing it" is so boring and reasonable! ;)

By which I mean, yeah, fair point. Do you have any suggestions for turning unformatted lyrics (or podcast transcripts) into timed subtitles?

emk commented

Sadly, no, I did all mine the really hard way back in the mists of time, with custom shell scripts. It was one of the earliest prototypes.

I do strongly recommend finding a Mac equivalent of Subtitle Edit. There are some really great subtitle-editing tools out there, and substudy does not plan to replace them. :-)

emk commented

This has been open for a while, and there's no obvious consensus that I need to fix something here, so I'm going to go ahead and close this. :-)