/Text-to-Speech-TTS-3000

Basic text-to-speech reader in Python

Primary LanguagePythonMIT LicenseMIT

TTS 3000: a highlighted-text-to-speech reader

TTS 3000 reads highlighted text aloud on command. Behind the scenes, the program captures the highlighted text, converts it to an audio file, and then reads the audio file aloud. The user can control playback and adjust settings such as the voice, speed, volume, and rewind/fast-forward duration.

Controls

  • CTRL + b == play/pause highlighted text
  • ALT + v == rewind
  • ALT + n == fast-forward
  • ALT + o == extract text using OCR
  • Minimise the prompt/console/terminal window to use TTS 3000
  • Close the prompt/console/terminal window to quit TTS 3000

User guide

  1. Download or clone this GitHub repository, and unzip it if it downloads as a .zip file
  2. Run the program (main.py), such as via opening/double-clicking launch_tts.bat or via Anaconda Prompt
  3. Minimise the console once you see "TTS 3000 in now running"
  4. Highlight some text, such as in a browser, Word document, or PDF, and press CTRL + b (default, can be changed)
  • Optional: customise playback settings and hotkeys using the USER SETTINGS section near the top of the main.py file, which can be opened in, for example, Notepad.
  • Note: TTS 300 creates temporary audio files, so be aware of who has access to your TTS 3000 folder. Anyone with access to your TTS 3000 folder will be able to see the temporary TTS 3000 audio files while they exist. You should avoid, for example, placing TTS 3000 in a folder on a shared drive and then using TTS 3000 to read private bank statements.
Click to show screenshots showing the console, example usage, default hotkeys, and user settings

image image image image

Debugging

  • CTRL + # == report status to console (for debugging)
  • CTRL + 1 + 2 == stop/exit/escape

Created March 2023

Author: hawkem / mh0w