ATOM is a personal assistant for managing and reviewing my schedule.
It uses various APIs and packages such as OpenAI, ElevenLabs, Google Calendar, and Twilio to offer several features such as creating/deleting calendar events and searching the internet for answers to specific questions.
The code uses a microphone to receive audio inputs from users, transcribes them into text, and then generates audio responses for the user.
Demo on my Twitter.
Python 3.x
openai
elevenlabslib
pydub
simpleaudio
speech_recognition
search
twilio
langchain
google-api-python-client
google_docs
twilio_utils
screen_capture
You will need credentials for google_drive
, google_docs
, service_account
.
Install the required dependencies:
pip install -r requirements
Set the API keys and authentication tokens in config.py
and google_cal
.
Run the code by executing the command python atom.py
in your terminal or IDE.
- Create, delete, and view Google Calendar events using Google API.
- Searching the internet for answers to specific questions using the Google API.
- Text-to-speech synthesis and speech-to-text recognition using ElevenLabs API and speech_recognition package.
- Sending SMS to recipients using Twilio API.
- Using OpenAI API for natural language processing.
- Microsoft Azure's Computer Vision
- Create Drafts with Google Docs.
This script is open-source and licensed under the MIT License. For more details, check the LICENSE file.