This project can be utilized in three different ways, each catering to different needs and environments. Below, you'll find a brief overview of these methods:
-
Google Colab: Ideal for those who prefer a cloud-based environment, Google Colab offers an accessible platform to run the code without any local setup. You can simply upload the notebook containing the project code to Colab and execute it there. Colab also provides access to GPU resources, which can accelerate computations.
-
Docker: If you prefer containerized deployment, you can use Docker to build and run the project. By encapsulating the project and its dependencies in a Docker container, you ensure a consistent and reproducible environment across different machines. This is particularly useful for managing dependencies and avoiding conflicts.
-
Direct Installation on Your Computer: For those who want to run the project directly on their local machine, a direct installation can be done. This involves cloning the project repository, installing the required dependencies as specified in the requirements.txt file, and executing the code in your preferred development environment.
🚀 Try transcription_diarization live in 60s
Installation (Now only works with Nvidia GPU's. I will add support for Macs Later on using this https://github.com/ggerganov/whisper.cpp)
-
Installing Required Libraries: The project utilizes Python libraries specified in the
requirements.txt
file. You can install these libraries using the following command:pip install -r requirements.txt
-
Starting the API: In the terminal, navigate to the main directory of the project and run the following command:
python app.py
This will start the API at the address
http://127.0.0.1:5000/
. -
Sending a Transcription Request: To obtain the transcription of an audio file, you can use the following
curl
command:curl -X POST -F "files=@file_path1.mp3" -F "files=@file_path2.mp3" http://localhost:5000/transcribe
This command sends the
voice.mp3
file to the API and returns the transcription result as JSON.
MIT License
Copyright (c) [2023] [EmreOzdemiroglu]
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction. The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.