Speech GPT

A simple command-line tool that allows you to chat with your speeches.

Features

Speech to text using OpenAI Whisper
VectorDB for storing speeches

Prerequisites

Python 3.10
FFMPEG v4+

Installation

Clone this repository:

git clone https://github.com/conceptcodes/speech-gpt-python.git
cd speech-gpt-python

Create a virtual environment:

 conda create -n speech-gpt python=3.10
 conda activate speech-gpt

Install dependencies:
```
pip install -r requirements.txt
```
Grab an API key from OpenAI
export the API key to your environment
```
export OPENAI_API_KEY=<your-api-key>
```

Usage

To run the CLI, use the following command:

python main.py -f /path/to/file


     _______..______    _______  _______   ______  __    __  
    /       ||   _  \  |   ____||   ____| /      ||  |  |  | 
   |   (----`|  |_)  | |  |__   |  |__   |  ,----'|  |__|  | 
    \   \    |   ___/  |   __|  |   __|  |  |     |   __   | 
.----)   |   |  |      |  |____ |  |____ |  `----.|  |  |  | 
|_______/    | _|      |_______||_______| \______||__|  |__| 
                                                             
  _______ .______   .___________.
 /  _____||   _  \  |           |
|  |  __  |  |_)  | `---|  |----`
|  | |_ | |   ___/      |  |     
|  |__| | |  |          |  |     
 \______| | _|          |__|     
                                 

Model loaded!
/opt/anaconda3/envs/speech-gpt/lib/python3.10/site-packages/whisper/transcribe.py:126: UserWarning: FP16 is not supported on CPU; using FP32 instead
  warnings.warn("FP16 is not supported on CPU; using FP32 instead")
Transcription done!

Creating vector store...
Saving vector store...

>>> Whats the theme of this talk?

Speech GPT: 
The theme of this conversation appears to be functional programming and the use of types in designing functions.

>>>

Roadmap

Store embeddings in local postgres DB
Add a cache for files
clean up the CLI
Add timestamp sources to the response

ConceptCodes/speech-gpt-python

Speech GPT

Features

Prerequisites

Installation

Usage

Roadmap