Markismus/PocketBookDic

Need Help in Converting .mobi to .xdxf/.dic messed up with numbering & line break

retiarylime opened this issue · 0 comments

I want to convert this kindle mobi Merriam-Webster's Advanced Learner's English Dictionary to xdxf and dic format for pocketbook. I have first converted the mobi into html using kindleunpack. Then I tried using your script but the reconstruction thing messed up with the superscript, numbering and line break of the definition.

This is the HTML view (Merriam-Webster's divide multiple definitions with superscript numbering; note that superscript number 1 above the word back):
Screenshot 2024-02-29 100317
Screenshot 2024-02-29 100327

Here I give you some screenshots from the pocketbook for the definition of the word 'back' with messed up numberings & repeated numbering style which is confusing. The line break is also confusing. Note the superscript was converted to normal size & there is additional small letter roman numbering:
scr0001
scr0002
scr0003
scr0004

Dictionary file:
html.zip
mobi.zip