A Python script to add/remove furigana for Japanese epub books. Using Mecab and Unidic.
ある日の放課後だった。
ある日の放課後だった。
Should work with Python3.6 or higher, but only tested with Python 3.7.5
pip install furigana4epub
If you want to use the full version of UniDic, read this article.
To add furigana:
furigana4epub target.epub
To remove furigana:
furigana4epub -d target.epub
furigana4epub -h
usage: furigana4epub [-h] [-e EXTENSION] [-r] [-s SUFFIX] [-d] [-b] [-p]
paths [paths ...]
A Python script to add/remove furigana for Japanese epub books. Using Mecab
and Unidic.
positional arguments:
paths Paths of Japanese epub books,can be file names or file
folders
optional arguments:
-h, --help show this help message and exit
-e EXTENSION, --extension EXTENSION
File extension to filter by(default:.epub)
-r, --recursive Search through subfolders
-s SUFFIX, --suffix SUFFIX
suffix of the converted file(default:"_furigana" for
adding or "_no_furigana" for removing furiganas)
-d, --remove remove furigana from epub file
-b, --blod Covert <ruby> dot to html <b> tag before adding
furigana
-p, --rp Do not add ruby <rp> tag to provide fall-back
parentheses for browsers that do not support display
of ruby annotations.Result a smaller output but with
less compatibility.
If you are using Calibre for ebook conversion, choose azw3(KF8) output format.
Calibre's mobi output format wouldn't support <ruby>
tag.
This script is inspired by WebNovelCrawler, pinyin2epub ,have some codes from them.