๐ [Want to contribute?] ebook2audiobookxtts roadmap
Opened this issue ยท 7 comments
All Features open to public Contributions โญ
- -h -help parameter info in different languages
- Notebooks Folder Talked about here
- Make Chinese text splitting not split words and improve pause timing Talked about here
- Get Kaggel Notebook working (Needs to be giving dropdown menus to interact with the headless version) Talked about here
- Get Working Google Colab Notebook Talked about here
- Make a ios app
- Make an android app
Wanted Extra Parameters
- Translate from translate the full ebook to X lang and audiobook it, output original ebookfile, audiobook file, and translated ebook file
- Make parameter for specifying output audio file format
- The F5-TTS model referenced here
- Make ebook input parameter accept a list of files for multiple files.
- Make a way for multiple lines to have audio generated at a time using multiple instances of coqui tts running for more beefy hardware.
- Make ebook input parameter accept a folder containing ebook files to auto-run through.
- OCR for PDF files (as a Parameter) Talked about here
- Add a force use device (cpu or GPU) (This will force set the device at the top of the script) Talked about here(currently being added by @ROBERT-MCDOWELL) ref here
- Use Deepfilternet2 to de-noise any reference audio for voice cloning, demo huggingfacespace using it, Talked about here
- Custom model dir input for pointing to a folder containing all of the custom model files if available instead of having to point to each model file individually
- Change voices per chapter parameter Talked about here
My Other Repos I Want to Integrate into the App for Extra Options :)
- Add app parameter that launches the ebook2audiobookpiper-tts GUI.(Piper tts appears to have issues working in ARM (apple silicon)MAC But runs fine in the docker on ARM)
- Add app parameter that launches the ebook2audiobookstyletts2 GUI.
- Add app parameter that launches the ebook2audiobook-espeak GUI.
- Add app parameter that launches the FineTune XTTS GUI.
- Add app parameter for using Barktts documentation from coqui tts
Create a standard function for load_model() and inference_model() for:
- โXTTSv2
- Styletts2
- ๐ช Piper-tts
- ๐ถ Bark tts
# Standard functions should be:
def load_model() - Will load model and download model to load if not available locally
def inference_model() - Will inference the pre-loaded model
Create Readme in these languages
- English (en)
- Spanish (es)
- French (fr)
- German (de)
- Italian (it)
- Portuguese (pt)
- Polish (pl)
- Turkish (tr)
- Russian (ru)
- Dutch (nl)
- Czech (cs)
- Arabic (ar)
- Chinese (zh-cn)
- Japanese (ja)
- Hungarian (hu)
- Korean (ko)
Binary builds Working pyinstaller script for:
- ๐ Mac Intel x86
- ๐ช Windows x86
- ๐ง Linux x86
- ๐ฅ๏ธ๐ Apple Silicon Mac
- ๐ช๐ช ARM Windows
- ๐ง๐ช ARM Linux
๐ Single pip command install that works for:
- being overseen by @ROBERT-MCDOWELL
- ๐ Mac Intel x86
- ๐ช Windows x86
- ๐ง Linux x86
- ๐ฅ๏ธ๐ Apple Silicon Mac
- ๐ช๐ช ARM Windows
- ๐ง๐ช ARM Linux
Extra Overkill for training models and such (All supported Coqio tts models and piper-tts in one easy command)
- For info about this @DrewThomasson, he is currently working on the development of this, work-in-progress-repo here
- Make a easy to use training gui for all coqio tts models in the ljspeech format training recipes here from coqui tts
For higher level developers:
- Integrate VoxNovel experimental functionality into this ๐คท eventually. . .
Wanted Auto-testing scripts for development
- Standard model headless run through every language sample Samples located here
@DrewThomasson if you want to help out at all! ๐
Another interesting option would be to change voices between chapters. for i.e.:
--voice_mapping {"chapters": {1:"john.wav",2:"stella.wav",3:"child.wav",4:"random"} }
so the selected chapters will see their voice mapped, others will keep the main --voice intact.
Another interesting option would be to change voices between chapters. for i.e.: --voice_mapping {"chapters": {1:"john.wav",2:"stella.wav",3:"child.wav",4:"random"} } so the selected chapters will see their voice mapped, others will keep the main --voice intact.
@ROBERT-MCDOWELL Added to roadmap checklist
Compare tts
Translate ebook to X language
#35 (comment)
Integration with https://github.com/janeczku/calibre-web
@geneliu how you would see it?
flask api convo #179 (comment)