/bedrock-multimodal-agent

Multimodal Agent based on Bedrock, AST(Transcribe), TTS(Polly)

Primary LanguagePythonMIT No AttributionMIT-0

Multimodal Agent with LLM(Bedrock), AST(Transcribe), TTS(Polly)

Forked from Amazon Bedrock Voice Conversation and refine bugs, update according to Bedrock/Transcribe/Polly's new features.

How to run APP:

    python install -r ./requirements.txt
    export AWS_ACCESS_KEY_ID=<...>
    export AWS_SECRET_ACCESS_KEY=<...>
    export AWS_DEFAULT_REGION=<...> # Optional, defaults to us-east-1 
    export MODEL_ID=<...># Optional, defaults to Claude3 Haiku
    python ./app.py