Text to Speech with AWS Polly and Spring Boot

This project utilizes the AWS Polly service to convert text into audio. It integrates seamlessly with Spring Boot, providing two API endpoints that allow you to generate and play audio from text using HTTP calls.

Setup ⚙️

Before running the project, ensure you have the following environment variables set in your development environment:

AWS_ACCESS_KEY: Your AWS access key for authentication.
AWS_SECRET_KEY: Your AWS secret key for authentication.
AWS_REGION: The AWS Region where the Polly service is located.

How to Use the Project 📚

Clone the project repository from GitHub: Repository Link.
Set the aforementioned environment variables on your system.
Open a terminal and navigate to the project directory.
Run the project using the mvnw spring-boot:run command.

Generate Text-to-Speech Audio 🎤

To generate audio from text, use the following curl request:

curl --location 'http://localhost:8080/api/v1/speech' \
--header 'Content-Type: application/json' \
--data '{
    "text": "Sample text"
}'

Replace "Sample text" with the text you want to convert into audio. The generated audio file will be named "speech.mp3."

Play Text-to-Speech Audio 🔊

To play the generated audio, use the following curl request:

curl --location 'http://localhost:8080/api/v1/speech/play' \
--header 'Content-Type: application/json' \
--data '{
    "text": "Sample text"
}'

Replace "Sample text" with the text you want to play as audio. The audio will be streamed and played.

Project Workflow 📝