This walkthrough depicts an AWS Cloud-based solution that supports organizations in providing near real-time translation support through Amazon Connect, Amazon Transcribe, and serverless code through AWS Lambda. Translations that would take hours to complete manually are performed in moments and read back, in life-like speech, and in a member’s chosen language through Amazon Polly.
Figure 1. Technical architecture of the multilingual contact center solution described in this post. ![Technical architecture of the multiLingual contact center quickstart solution. ]
- The constituent calls the main contact center number. The greeting prompts the caller to select a language. The caller is asked to speak in the language of their choice.
- The constituent’s voice is streamed near real time using Amazon Kinesis Data Streams.
- Amazon Connect contact flow triggers a Lambda function to convert voice to text.
- Amazon Transcribe is used to transcribe the constituent’s spoken words into text.
- Transcribed text is stored in Amazon Simple Storage Service (Amazon S3) and Amazon DynamoDB.
- Amazon Connect Contact Control Panel (CCP) gets the transcribed text printed in the window.
- The transcribed text is translated to English using Amazon Translate and displayed in the CCP.
- The agent reviews the text and types the response in the CCP.
- Amazon Translate is used to convert the English text written by the agent into the language chosen by the constituent. Amazon Polly is used to create a voice file for the translated text.
- The translated voice is played back to the constituent as a response to the question they asked.
The steps below will guide you through the process of deploying a multilingual contact center. Section 1 will set up the foundation infrastructure needed to support the solution while Section 2 will walkthrough the steps to configure the Amazon Connect instance. Section3 walks through testing the deployed solution.
- Have an AWS account
- Have an understanding of Amazon Connect, Lambda, and AWS Identity and Access Management (IAM)
- Have permissions to create and modify Lambda functions
- Configure an Amazon Connect instance for inbound and outbound calls, and claim a phone number after you create your instance. The Get started with Amazon Connect documentation (the first two steps) provides valuable background knowledge for this process.
- Turn on data streaming on the Amazon Connect instance.
- Turn on live media streaming on the Amazon Connect instance
Section 1: Deploy AWS CloudFormation templates
- Clone the repo
- Navigate to the repository folder on your local machine
- Create a .zip of the following files/folders:
- cloudformation/asset.7b1f3c122a53128c55239bc2e97800299bcce83e3fb7394b79ed0b35af5757ee/mlcc-transcribe-polly.py
- Create a new Amazon S3 bucket and upload the folders named CloudFormation and CCP from the cloned repo.
- Navigate to the CloudFormation dashboard within the AWS console and create a CloudFormation stack using the file named “MultiLingualCC.yaml.” The following parameters are needed to be provided for the stack to launch.
- Stack name – The stack name is an identifier that helps you find a particular stack from a list of stacks. A stack name can contain only alphanumeric characters (case-sensitive) and hyphens. It must start with an alphabetic character and can't be longer than 128 characters.
- Amazon Connect instance ARN. how to find your instance ARN
- Call audio bucket name – Enter the (globally unique) name you would like to use for the Amazon S3 bucket where you will store the audio files and the sample contact flow. This template will fail to deploy if the bucket name you choose is currently in use.
- Website bucket name – Enter the (globally unique) name you would like to use for the Amazon S3 bucket where you will store the website assets and the sample contact flow. This template will fail to deploy if the bucket name you choose is currently in use.
- Resources bucket – This is the bucket you created in step 1c.
- audioFilePrefix – The Amazon S3 prefix where the audio files will be saved (must end in "/").
- CloudFront Price Class – Specify the CloudFront price class. See pricing details for more information.
- Compute type – Specify whether to use AWS Fargate or Lambda for transcribing call audio to text. The solution, by default, uses a Lambda function to transcribe call audio consumed from Amazon Kinesis Video Streams. Lambda has a maximum run time of 15 minutes per invocation, whereas Fargate has no such limits and can run as long as the call is connected.
- rawAudioUploadPrefix – The Amazon S3 prefix where raw/wav (audio/L16; mono; 8 kHz) audio recordings may be uploaded if you want to process an audio file compared to making a phone call and streaming from Kinesis Video Streams. This parameter is used for testing or for real-time transcription of audio files. This will only work with single-channel files (mono).
- sessionDuration – The maximum duration of the role session (in seconds). You can give a value from 900 seconds (15 minutes) to 3,600 seconds (1 hour). This is the maximum call duration before the customer interaction must be processed.
- Create the stack.
- After the stack is successfully launched, note the following outputs. It could take a few minutes for the stack to launch the solution.
- cloudfrontEndpoint – This is the CloudFront URL you will use to access the agent portal. You can find this in the Output tab for the stack.
- Create a lambda layer by cloning and following the instruction in the repo.
- Docker install is needed to run through the instructions in this step. Please ignore warnings/errors that show up while running the commands in the repo.
- This step will create the lambda layer that you can add to your Lambda Function in your AWS account.
- Add the layer to the lambda function named mlcc-Transcribe-Polly.
- Use the “Specify arn” option to choose the layer you deployed in step 8.
- Log in to the Amazon Connect instance using the admin user you created while launching the instance.
- Select the telephone number you claimed and assign the contact flow “mlccKvsStream,” as shown in Figure 2.
- Use the CloudFront URL noted in Section 1, step 4a to access the agent CCP.
- Call the claimed telephone number and choose the language.
- As you speak, note the agent CCP update with transcribed text in English.
- As an agent, type the response and select Submit.
- Observe the text translated to the language you chose and delivered back to you on the call using voice.
This walkthrough deploys a multilingual contact center that allows callers to talk in their preferred language without requiring the state to hire agents to cover all the languages.