Voice type anywhere in Windows using OpenAI's Whisper Speech Recognition engine! This project allows you dictating anywhere in Windows using OpenAI's Whisper speech-to-text engine.
(This is a fork of mxro/autohotkey-chatgpt-voice, I modified it to allow just voice typing instead of carrying out commands.)
I have to update this section to match the current fork...
- Download and install AutoHotKey V1 from autohotkey.com
- TBD
- Copy
config.template.json
to your ownconfig.json
, and edit it to provide your Open API Key for the propertyOpenapiKey
.
{
"OpenapiKey": "",
"AutoHotKeyExec": ".\\bin\\autohotkey-1.1.37.01\\AutoHotkeyU64.exe"
}
- Double click on
watch.ahk
from the extracted files - Press F8
- Say whatever you want to type
- Press F8
- Wait for Open AI and AutoHotKey to do their magic
The hotkey to start/stop a voice command is defined in watch.ahk
. You can replace the following with a hotkey of your choice:
F8::
- Edit transcriptionPrompt.txt to customize the transcription.
- Add multiple transcription prompt and .ahk files. For example, you can have multiple Trigger Hotkeys for different languages or contexts.
- ChatGPT-AutoHotkey-Utility: Uses AutoHotKey to perform a number of actions, such as translate
- ChatGPT Voice Assistant: Provides a Windows based assistant driven by ChatGPT
- How to Make Your Own Windows Transcription App With Whisper and AutoHotkey: Step by step tutorial to make a transcription app using AutoHotKey (added as per reddit)
task build
task package
go run ./cmd/whisper-autohotkey/.