kronusaturn/lw2-viewer

Add export to epub

Opened this issue · 9 comments

There are some efforts that scrape the site into an ebook (e.g., https://github.com/AABoyles/LessWrong-Portable, or general webpage to ebook convertors), but they don’t export the comments. (Unless we save the whole site as html and convert it to epub, which will have a lot of unwanted cruft.) They are also rather brittle.

It’d be so much better if the site natively supported exporting to epub.

Another idea is to have a stripped down, minimal html view (something like the print version of Wikipedia). I find this the better option, since it is suitable for printing and can be easily converted to epub by just saving the whole html.

Is there something wrong with the existing print view?

Our HTML is already pretty minimal -- I tried saving a post and converting it to EPUB using Calibre and it seems to work pretty well. There is a bit of cruft at the beginning but it's only a few lines.

I can’t find the existing print view😅

I am simply referring to the fact that using the Print feature of your browser uses the CSS specified for the print media type, i.e., it uses a special layout/styling/etc. for printing. (This is quite common.)

Once you’ve got that Print dialog open, you can save the file as a PDF, with the print layout, using your operating system’s print-to-PDF feature. (At least, the Mac OS has this, and, I think, Windows also?)

And a PDF can easily be converted into an epub…

This is on the medium-term roadmap for LessWrong. The main issue is that the print view of a single post isn't what you want, you generally want a sequence, or a list of posts. So this will probably follow after a reading-list feature.

@jimrandomh There are generalized tools that can do the sequence of pages part; E.g., https://epub.press/. What we need is just a nice, clean view of the content to feed into these.

I don’t remember what my issue was at the time, but I have been converting GW pages to EPUB using pandoc/calibre (and Mozilla Readability, though it’s not essential) without a hitch for many months. So I am closing the issue, thanks.

This still seems like it would be nice to have, along with an OPDS catalog to make it easy to browse directly from an ereader.